(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPER/VTION TREATY (PCT) 



(19) World IntcHectual Property /^^^ 

^n.^r:!^VLu ^^lillillllllllllilllllllilllllllllli 

(43) international Fublieation Date (10) International Publication Number 

3 June 2004 (03.06.2004) PCT WO 2004/045543 A2 



(51) International Patent Classillcatinn^: 



(25) Filing Language: linglish 

(26) Publication Language: Knglisti 

(30) Priority Data: 

60/426,137 14 November 2002 (14.1 1.2002) US 

60/502 ,050 1 0 Seplember 2003 ( 1 0.09.2003) US 

(71) Applicant: DHARMACON, INC. fUSAJSl; 2650 Crc- 
cenl Drive, Suite 100, Lafayette, CO 80026 (US). 

(72) Inventors: ANASTASIA, Klivorova; 635 Walden 
[ #204, Denver, CO (US). ANGELA, Reynolds; I 1445 
I Conifer Ridge Drive, Conifer CO 80433 (US). DEVIN, 
I Lcakc; 3050 Krameria Street, Denver, CO 80303 (US). 
: WILLIAM, Marshall; 495 Mohawk Drive, Denver, 
I CO 80303 (US). Stephen, Scarlnge; 2746 Pairie Ridge, 
: Lafayette, CO 80026 (US). 



(81) Designated States (naiioiml): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GO, GE. GH, 
OM, IIR, IIU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, IX, 
LK, LR, LS, LI, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NI. NO, NZ, OM, PG, PH, PL, PT, RO, RU, SC, 
SD, .Sli, SO, SK, S)., SY, T.1, TM, TN, TR, Tl", TZ, UA, 
UG, UZ, VC, VN. YU, ZA, ZM, ZW. 

(84) Designated Stales (regional): ARIIO patent (BW, GlI, 
GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurdiiian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CI 1, CY, CZ, DE, DK, EE, 
ES, PI, LR, GB, GR, HU, IE, I T, LU, MC, NL, Ff, RO, SE, 
SI, SK, TR), OAPI patent (BF, BJ, CE, CG, CI. CM, GA, 
GN, GQ, GW, ML, MR, NI'„ SN, TD, TG). 



without international search report and to be republished 
upon receipt of that report 

I Continued on next page) 



i (54) Title: FUNCTIONAL ANI5 HYPERnJNCTIONAL SIRNA 



!, l!'iiiiniii(iiiiiiiii "i! -4-"^ 



It 




O (57) Abslra 
^ siRNAsbyr 
^ genes. 



:l: EfTicicnt sequence spccilic gene silencing is possible lhn)ugh (he use ofsiRNA lechnulugy. By selecting particular 
iiionale design, one can maximize the generation of an effective gene silencing reagent, as well as methods for silencing 



wo 2004/045543 A2 llllllllllllllilillillllllllllilllllllllllllllllllllill^ 



with sequence listing part of description published sepa- 
rately in electronic form and available upon request from 
the International Bureau 



For two-leuer codes and mher ahhreviations. refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCI' Gazette. 



wo 2004/045543 



1 



PCT/IIS2003/036787 



Functional and Hyperfunctional siRNA 

/ 

5 Cross Reference to Related Applications 

This application claims the benefit of the filing date of U.S. Provisional 
Application Serial No. 60/426,137, filed November 14, 2002, entitled "Combinatorial 
Pooling Approach for siRNA Induced Gene Silencing and Methods for Selecting 
siRNA," and U.S. Provisional Application Serial No. 60/502,050, filed September 10, 
1 0 2003, entitled "Methods for Selecting siRNA," the entire disclosures of which are 
hereby incorporated by reference into the present disclosure. 

Field of Invention 

The present invention relates to RNA interference ("RNAi"). 

15 

Background of the Invention 

Relatively recently, researchers observed that double stranded RNA 
("dsRNA") coxild be used to inhibit protem expression. This ability to silence a gene 
has broad potential for treating human diseases, and many researchers and 
20 commercial entities are currency investing considerable resources in developing 
therapies based on this technology. 

Double stranded JRNA induced gene silencing can occur on at least three 
different levels: (i) transcription inactivation, which refers to RNA guided DNA or 
25 histone methylation; (ii) siRNA induced mRNA degradation; and (iii) mRNA induced 
transcriptional attenuation. 



It is generally considered that the major mechanism of RNA induced silencing 
(RNA interference, or RNAi) in mammalian cells is mRNA degradation. Initial 
30 attempts to use RNAi in mammalian cells focused on the use of long strands of 

dsRNA. However, these attempts to induce RNAi met with limited success, due in 
part to the induction of the interferon response, which results in a general, as opposed 
to a target-specific, inhibition of protein synthesis. Thus, long dsRNA is not a viable 
option for RNAi in mammalian systems. 
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More recently it has been shown that when short (1 8-30 bp) RNA duplexes are 
introduced into mammalian cells in culture, sequence-specific inhibition of target 
mRNA can be realized without inducing an interferon response. Certain of these 

5 short dsRNAs, referred to as small inhibitory RNAs ("siRNAs"), can act catalytically 
at sub-molar concentrations to cleave greater than 95% of the target mRNA in the 
cell. A description of the mechanisms for siRNA activity, as well as some of its 
applications are described in Provost et al, Ribonuclease Activity and RNA Binding of 
Recombinant Human Dicer, E.M.B.O. J., 2002 Nov. 1 ; 21(21): 5864 -5874; Tabara et 

10 al., The dsRNA Binding Protein RDE-4 Interacts with RDE-1, DCR-1 and a DexH- 
box Helicase to Direct RNAi in C. elegans, Cell 2002, June 28;109(7):861-71 ; Ketting 
et al.. Dicer Functions in RNA Interference and in Synthesis of Small RNA Involved in 
Developmental Timing in C. elegans; Martinez et al, Single-Stranded Antisense 
SiRNAs Guide Target RNA Cleavage in RNAi, Cell 2002, Sept. 6; 110(5):563; 

1 5 Hutvagner & Zamore, A microRNA in a multiple-turnover RNAi enzyme complex. 
Science 2002, 297:2056. 

From a mechanistic perspective, introduction of long double stranded RNA 
into plants and invertebrate cells is broken down into siRNA by a Type III 

20 endonuclease known as Dicer. Sharp, RNA interference— 2001, Genes Dev. 2001, 
15:485. Dicer, a ribonuclease-III-like enzyme, processes the dsRNA into 19-23 base 
pair short interfering RNAs with characteristic two base 3' overhangs. Bernstein, 
Caudy, Hammond, & Harmon, Role for a bidentate ribonuclease in the initiation step 
of RNA interference, Nahire 2001, 409:363. The siRNAs are then mcorporated into 

25 an RNA-induced silencing complex (RISC) where one or more helicases unwind the 
siRNA duplex, enabling the complementary antisense strand to guide target 
recognition. Nykanen, Haley, & Zamore, /IIP requirements and small interfering 
RNA structure in the RNA interference pathway, Cell 2001 , 1 07:309. Upon binding to 
the appropriate target mRNA, one or more endonucleases within the RISC cleaves the 

30 target to induce silencing. Elbashir, Lendeckel, & Tuschl, RNA interference is 
mediated by 21- and 22-nucleotide RNAs, Genes Dev 200 1 , 1 5 : 1 88, Figure 1 . 

The interference effect can be long lasting and may be detectable after many 
cell divisions. Moreover, RNAi exhibits sequence specificity. Kisielow, M. et al. 
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(2002) Isoform-specific knockdown and expression of adaptor protein ShcA using 
small interfering KNA, J. of Biochemistry 363: 1-5. Thus, the RNAi machinery can 
specifically knock down one type of transcript, while not affecting closely related 
mRNA. These properties make siRNA a potentially valuable tool for inhibiting gene 
5 expression and studying gene function and drug target validation. Moreover, siRNAs 
are potentially useful as therapeutic agents against: (1) diseases that are caused by 
over-expression or misexpression of genes; and (2) diseases brought about by 
expression of genes that contain mutations. 

1 0 Successfiil siRN A-dependent gene silencing dqjends on a number of factors. 

One of the most contentious issues in RNAi is the question of the necessity of siKNA 
design, i.e., considering the sequence of the siRNA used. Early work in C. elegans 
and plants circumvented the issue of design by introducing long dsRNA (see, for 
instance. Fire, A. et al. (1998) Nature 391:806-81 1). In this primitive organism, long 
1 5 dsRNA molecules are cleaved into siRNA by Dicer, thus generating a diverse 

population of duplexes that can potentially cover the entire transcript. While some 
fraction of these molecules are non-functional (i.e. induce little or no silencing) one or 
more have the potential to be highly ftinctional, thereby silencing the gene of interest 
and alleviating the need for siRNA design. Unfortunately, due to the interferon 
20 response, this sjohe approach is unavailable for mammalian systems. While this effect 
can be circumvented by bypassing the Dicer cleavage step and directly introducing 
siRNA, this tactic carries with it the risk that the chosen siRNA sequence may be non- 
functional or semi-functional. ■■■■■■ • " 

25 A number of researches have expressed the view that siRNA design is not a 

crucial element of RN Ai. On the other hand, others in the field have begun to explore 
the possibility that RNAi can be made more efficient by paying attention to the design, 
of the siRNA. Unfortunately, none of the reported methods have provided a 
satisfactory scheme for reliably selecting siRNA with acceptable levels of 

30 ftmctionality. Accordingly, there is a need to develop rational criteria by which to 
select siRNA with an acceptable level of functionality, and to identify siRNA that 
have this improved level of functionality, as well as to identify siRNAs that are 
hyperfimctional. 



wo 2004/045543 PCT/US2003/036787 
4 

Summary of the Invention 

The present invention is directed to increasing the efficiency of RNAi, 
particularly in mammalian systems. Accordingly, the present invention provides kits, 
siRNAs and methods for increasing siRNA efficacy. 

5 

According to one embodiment, the present invention provides a kit for gene 
silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, 
each of which is comprised of a sequence that is complementary to a portion of the 
sequence of one or more target messenger RNA. 

10 

According to a second embodiment, the present invention provides a method 
for optimizing RNA interference by using one or irtore siRNAs that are optimized 
according to a formula (or algorithm) selected from: 
Formula I 

15 Relative ftinctionality of siRNA= -(GC/3) +(AUi5.i9) -(Tm2o»c)*3 -{Gi3)*3 -(C19) 
+(A,9)*2 +(A3) +(Uio)+(A,4) -(Us) -(All) 

Formula II 

Relative functionality of siRNA= -(GC/3) -(AUi5-i9)*3 -(G,3)*3 ^ds) +(A,9)*2 
20 +(A3) .--r^^ 

Formula III 

Relative functionality of siRNA= -(GC/3) +(AUis-i9) -(Tm2o°c)*3 - 

25 Formula IV 

Relative functionality of siRNA= 

-GC/2+( AU,5-i9)/2-(Tm2o«c)*2 -(G,3)*3 -(C.s) +(A,9)*2 +(A3) +(U,o)+(Ai4) -(U5) - 
(An) 

30 Formula V 

Relative fimctionality of siRNA=-(G,3)*3 -(C,?) +(Ai9)*2 +(A3) + CUio)+(Ai4) -(Us) 
-(An) 

Formula VI 
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Relative functionaUty of siRNA=-(Gi3)*3 -(C,?) +(Ai9)*2 +(A3) 
Formula VII 

Relative functionality of siRNA=-(GC/2) +(AUis-i9)/2 -( Tm2o°c)*l -(Gi3)*3 -{C,,) 
5 +(A,9)*3 +(A3)*3 +(U,o)/2+(A,4)/2 -(U5)/2 -(A, ,)/2 

wherein in Formulas I - VII: 

Tm 20°c=l if the Tm is greater than 20°C; 
10 Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its value 

is 0; 

AUi5-i9 = 0-5 depending on the number of A'-Or U bases on the sense strand 

at 

positions 15-19; 

15 Gi3 = 1 if G is the base at position 1 3 on the sense strand, otherwise its value is 

0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

0; 

GC= the number of G and C bases in the entire sense strand; 
20 A3= 1 if A is the base at position 3 on the sense strand, otherwise its value is 0; 

Ai 1= 1 if A is the base at position 11 on the sense strand, otherwise its value is 

0; 

Ai4= 1 if A is the base at position 14 on the sense strand; otherwise its value is 

0; 

25 Uio= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

U5= 1 if U is the base at position 5 on the sense strand, otherwise its value is 0; 

or 

30 Formula VIII Relative functionality of siRN A = 

(-14)*Gi3-13*A,-12*U7-ll*U2-10*A„-10*U4-10*C3-10*C5-10*C6- 
9*Aio-9*U9-9*Ci8-8*Gio-7*Ui-7*Ui6-7*C,7-7*C,9 
+7*Ui7+8*A2+8*A4 +8*A5+8*C4 +9*G8 +1 0*A7+1 0*U,8+1 1 *Ai9+ 
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11*C9+15*G,+ 18*A3+19*U,o-Tni-3* (GQo,ai)- 6*(GC,5-i9)- 
30*X; and 

Formula IX Relative functionality of siRNA = 
5 (14.1)*A3+(14.9)*A6+(17.6)*A,3+(24.7)*A,9+(14.2)*Uio+(10.5)* 
C9+(23.9)*G,+(16.3)*G2+(-12.3)*A„+(-19.3)*U,+(-12.1)*U2+ 
(-ll)*U3+(-15.2)*U,5+(-11.3)*U,5+(-11.8)*C3+(-17.4)*C6+(- 
10.5)*C7+ (-13.7)*Gi3+(-25.9)*G,9-Tm-3* (GQotai) ~ 6*(GCi5-i9)- 
30*X 

10 wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, btherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4= 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
15 A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 

A6= 1 if A is the base at position 6 of the sense strand, otherwise its value 'is 0; 

A7 = 1 if A is the base at position ? of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 

An = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
20 Ai3 = 1 if A is the base at-p'osition 1 3 of the sense strand, otherwise its value is 0; 

Ai9= 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

C3 = 1 if Cis the base at position 3 of the sense strand, otherwise its value is 0; 
25 C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

Cs = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 

Ce = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
30 Cn = 1 if C is the base at position 1 7 of the sense strand, otherwise its value is 0; 

C18 = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

Ci9 = 1 if C is the base at position 1 9 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 
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Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
5 G 13 = 1 if G is the base at position 1 3 on the sense strand, otherwise its value is 0; 
Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
10 U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 

U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 

U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 

U9 = lif U is the base at position 9 on the sense strand, otherwise its value is 0; 
15 Uiq = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 

Ui5 = 1 if U is tiie base at position 15 on the sense strand, otherwise its value is 0; 

U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 

U18 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 
20 ■ 

GCi5_i9 = the number of G and C bases within positions 15 - 19 of the sense strand 
or within positions 15 -18 if the sense sh^ind is only 18 base pairs in length; 

GCtotai = the number of G and C bases in the sense strand; 

Tm = 1 00 if the targeting site contains an inverted repeat longer than 4 base pairs, 
25 otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a row. 

According to a third embodiment, the present invention is directed to a kit 
comprised of at least one siRNA that contains a sequence that is optimized according 
30 to one of the formulas above. Preferably the kit contains at least two optimized 
siRNA, each of which comprises a duplex, wherein one strand of each duplex 
comprises at least eighteen contiguous bases that are complementary to a region of a 
target messenger RNA. For mammalian systems, the siRNA preferably comprises 
between 18 and 30 nucleotide base pairs. 
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The ability to use the above algorithms, which are not sequence or species 
specific, allows for the cost-effective selection of optimized siRNAs for specific 
target sequences. Accordingly, there will be both greater efficiency and rehability in 
5 the use of siRNA technologies. 

According to a fourth embodiment, the present invention provides a method 
for developing an siRNA algorithm for selecting fimctional and hyperfimctional 
siRNAs for a given sequence. The method comprises: 
10 (a) . selecting a set of siRNAs; 

(b) measuring the gene silencing ability of each siRNA fi-om said set; 

(c) determining the relative fimctionality of each siRNA; 

(d) determining the amoimt of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 

1 5 the total GC content, melting temperature of the siRNA, GC content at 

positions 15-19, the presence or absence of a particular nucleotide at a 
particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 

(e) developing an algorithm using the information of step (d). 
20 ^ - 

According to this embodiment, preferably the set of siRNAs comprises at least 
90 siRNAs from at least one gene, more preferably at least 180 siRNAs from at least 
two different genes, and most preferably at least 270 and 360 siRNAs from at least 
three and four different genes, respectively. Additionally, in step (d) the 
25 determination is made with preferably at least two, more preferably at least three, 
even more preferably at least four, and most preferably all of the variables. The 
resulting algorithm is not target sequence specific. 

In a fifth embodiment, the present invention provides rationally designed 
30 siRNAs identified using the formulas above. 



In a sixth embodiment, the present invention is directed to hyperfimctional 

siRNA. 



wo 2004/045543 PCT/US2003/036787 
9 

For a better understanding of the present invention together with other and 
further advantages and embodiments, reference is made to the following description 
taken in conjunction with the examples, the scope of which is set forth in the 

appended claims. 

5 

Brief Description of the Figures 

Figure 1 shows a model for siRNA-RlSC interactions. RISC has the ability to interact 
with either end of the siRNA or miRNA molecule. Following binding, the duplex is 
unwound, and the relevant target is identified, cleaved, and released. 

10 

Figure 2 is a representation of the fimctionality of two hundred and seventy siRNA 
duplexes that were generated to target human cyclophilin, human diazepam-binding 
inhibitor (DB), and firefly luciferase. 

1 5 Figure 3a is a representation of the silencing effect of 30 siRNAs in three different 
cells lines, HEK293, DU145, and Hela. Figure 3b shows the firequency of different 
fiinctional groups (>95% silencing (black), >80% silencing (gray), >50% silencing 
(dark gray), and <50% silencing (white)) based on GC content. In cases where a given 
bar is absent jBx>m a particular GC percentage, no siRNA were identified for that 

20 particular group. Figure 3c shows the frequency of different functional groups based 
on melting temperature (Tm), Again, each group has four different divisions: >95% 
(black), >80% (gray), >50% (dark gray), and <50% (white) silencing. 

Figure 4 is a representation of a statistical analysis that revealed correlations between 
25 silencing and five sequence-related properties of siRNA: (A) an A at position 19 of 
the sense strand, (B) an A at position 3 of the sense strand, (C) a U at position 10 of 
the sense strand, (D) abase other than G at position 13 of the sense strand, and (E) a 
base other than C at position 19 of the sense strand. All variables were correlated with 
siRNA silencing of firefly luciferase and human cyclophilin. SiRNAs satisfying the 
30 criterion are grouped on the left (Selected) while those that do not, are grouped on the 
right (Eliminated). Y-axis is "% Silencing of Control." Each position on the X-axis 
represents a unique siRNA. 
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Figures 5 A and 5 B are representations of firefly luciferase and cyclophilin siRNA 
panels sorted according to functionality and predicted values using Formula VIII. The 
siRNA found within the circle represent those that have Formula VIII values 
(SMARTscores™) above zero. SiRNA outside the indicated area have calculated 
5 Fomiula VIII values that are below zero. Y-axis is "Expression (% Control)." Each 
position on the X-axis represents a unique siRNA. 

Figure 6A is a representation of the average internal stability profile (AISP) derived 
fi-om 270 siRNAs taken from three separate genes (cyclophilin B, DBI and firefly 
10 luciferase). Graphs represent AISP values of highly functional, functional, and non- 
functional siRNA. Figure 6B is a comparison between the AISP of naturally derived 
GPP siRNA (filled squares) and the AISP of siRNA from cyclophilin B,-DBI, and 
luciferase having >90% silencing properties (no fill) for the antisense strand. "DG" is 
the symbol for AG, firee energy. 

15 

Figure 7 is a histogram showing the differences in duplex fimctionality upon 
introduction of basepair mismatches. The X-axis shows the mismatch introduced inot 
the siRNA and the position it is introduced (e.g., 8C->A reveals that position 8 (which 
normally has a C) has been changed to an A). The Y-axis is "% Silencing 
20 (Normailized to Control)." 

Figure 8a is histogram that shows the effects of 5 'sense and antisense strand 
- - - modification with 2'-0-methylation on fvincfionality. Figure 8b is an expression • 

profile showing a comparison of sense strand off-target effects for IGFlR-3 and 2'-0- 
25 methyl IGFlR-3. Sense strand off-targets (lower white box) are not induced when the 

5* end of the sense strand is modified with 2'-0-methyl groups (top white box). 

Figure 9 shows a graph of SMARTscores™ versus RNAi silencing values for more 
than 360 siRNA directed against 30 different genes. SiRNA to the right of the vertical 
30 bar represent those siRNA that have desirable SMARTscores™. 

Figures lOA - E compare the RNAi of five different genes (SEAP, DBI, PLK, 
Firefly Luciferase, and Renila Luciferase) by varying numbers of randomly selected 
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siRNA and four rationally designed (SMART-selected) siRNA chosen using the 
algorithm described in Formula VEI. In addition, RNAi induced by a pool of the four 
SMART-selected siRNA is reported at two different concentrations (100 and 400nM). 
lOF is a comparison between a pool of randomly selected EGFR siRNA (Pool 1) and 
5 apool of SMART selected EGFR siRNA (Pool 2). Pool 1, SI— S4 and Pool 2 SI— 
S4 represent the individual members that made up each respective pool. Note that 
numbers for random siRNAs represent the position of the 5' end of the sense strand of 
the duplex. The Y-axis represents the % expression of the control(s). The X-axis is 
the percent expression of the control. 

10 

Figure 11 shows the Western blot results from cells treated with siRNA directed 
against twelve different genes involved in the clathrin-dependent endocytosis ' 
pathway (CHC, Dynll, CALM, CLCa, CLCb, Epsl5, EpslSR, Rab5a, Rab5b, RabSc, 
P2 subunit of AP-2 and EEA.1). SiRNA were selected using Formula VIII. "Pool" 
1 5 represents a mixture of duplexes 1 -4. Total concentration of each siRNA in the pool , 
is 25 nM. Total concentration = 4 x 25 = ICQ nM. 

Fig;iire 12 is a representation of the gene silencing capabilities of rationally-selected 
siRNA directed against ten different genes (human and mouse cyclophilin, C-myc, 
20 human lamin A/C, QB (ubiquinol-cytochrome c reductase core protein I), MEKl and 
MEK2, ATEl (arginyl-tRNA protein transferase), GAPDH, and Eg5). The Y-axis is 
the percent expression of the control. Numbers 1, 2, 3 and 4 represent individual 
rationally selected siRNA. "Pool" represents a mixture of the four individual siRNA. 

25 Figure 13 is the sequence of the top ten Bcl2 siRNAs as determined by Formula VIII. 
Sequences are listed 5' to 3'. 

Figure 14 is the knockdown by the top ten Bcl2 siRNAs at lOOnM concentrations. 
The Y-axis represents the amount of expression relative to the non-specific (ns) and 
30 transfection mixture control. 



Figure 15 represents a functional walk where siRNA beginning on every other base 
pair of a region of the luciferase gene are tested for the ability to silence the luciferase 
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gene. The Y-axis represents the percent expression relative to a control. The X-axis 
represents the position of each individual siRNA. 



Figure 16 is a histogram demonstrating the inhibition of target gene expression by 
5 pools of 2 and 3 siRNAs duplexes taken from the walk described in Figure 1 5. The 
Y-axis represents the percent expression relative to control. The X-axis represents the 
position of the first siRNA in paired pools, or trios of siRNA. For instance, the first 
paired pool contains siRNA 1 and 3. The second paired pool contains siRNA 3 and 5. 
Pool 3 (of paired pools) contains siRNA 5 and 7, and so on. 

10 

figure 17 is a histogram demonstrating the inhibition of target gene expression by 
pools of 4 and 5 siRNA duplexes. The Y-axis represents the percent expression- 
relative to a control. The X-axis represents the positoin of the first siRNA in each 
pool. 

15 

Figure 18 is a histogram demonstrating the inhibition of target gene expression by 
siRNAs that are ten and twenty basepairs apart. The Y-axis represents the percent 
expression relative to a control. The X-axis represents the position of the first siRNA 
in each pool. 

20 

Figure 19 shows that pools of siRNAs (dark gray bar) work as well (or better) than 
the best siRNA in the pool (light gray bar). The Y-axis represents the percent 
« expression relative to a control. The-X axis represents the position of the first siRNA 
in each pool. 

25 

Figure 20 shows that the combination of several semifunctional siRNAs (dark gray) 
result in a significant improvement of gene expression inhibition over individual 
(semi-functional; light gray) siRNA. The Y-axis represents the percent expression 
relative to a control. 

30 

Figure 21 shows both pools (Library, Lib) and individual siRNAs in inhibition of 
gene expression of Beta-Galactosidase, Renilla Luciferase and SEAP (alkaline 
phosphatase). Numbers on the X-axis indicate the position of the 5'-most nucleotide 
of the sense strand of the duplex. The Y-axis represents the percent expression of 
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each gene relative to a control. Libraries contain siRNAs that begin at the following 
nucleotides: Seap: Lib 1: 206, 766, 812,923, Lib 2: 1117, 1280, 1300, 1487, Lib 3: 
206, 766, 812, 923, 1117, 1280, 1300,1487, Lib 4: 206, 812, 1117, 1300, Lib 5: 766, 
923, 1280, 1487, Lib 6: 206, 1487; Bgal: Lib 1: 979, 1339, 2029, 2590, Lib 2: 
5 1087,1783,2399,3257, Lib 3: 979, 1783, 2590, 3257, Lib 4: 979, 1087, 1339, 1783, 
2029, 2399,2590,3257, Lib 5: 979, 1087, 1339, 1783, Lib 6: 2029,2399,2590,3257; 
Renilla: Lib 1: 174,300,432,568, Lib 2: 592, 633, 729,867, Lib 3: 174, 300, 432, 
568, 592, 633,729,867, Lib 4: 174, 432, 592, 729, Lib 5: 300,568,633,867, Lib 6: 
592,568. 

10 

Figure 22 showS the results of an EGFR and TfinOR internalization assay when single 
gene knockdowns are performed. The Y-axis represents percent internalization 
relative to control. 

15 

Figure 23 shows the results of an EGFR and TfiiR internalization assay when 
multiple genes are knocked down (e.g. RabSa, b, c). The Y-axis represents the 
percent internalization relative to control. 

20 Figure 24 shows the simultaneous knockdown of four different genes. SiRNAs 

directed against G6PD, GAPDH, PLK, and UBQ were simultaneously introduced into 
cells. Twenty-four hours later, cultures were harvested and assayed for mRNA target 
: levelsifor-each of the four genes. A comparison is made between cells transfected 
with individual siRNAs vs. a pool of siRNAs directed against all four genes. 

25 

Figure 25 shows the functionality of ten siRNAs at 0.3nM concentrations. 

Detailed Description 
Definitions 

30 Unless stated otherwise, the following terms and phrases have the meanings 

provided below: 
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siRNA 

The term "siRNA" refers to small inhibitory RNA duplexes that induce the 
RNA interference (RNAi) pathway. These molecules can vary in length (generally 
between 1 8-30 basepairS) and contain varying degrees of complementarity to their 
5 target mRNA in the antisensesti'and. Some, but not all, siRNA have unpaired 

overhanging bases on the 5' or 3' end of the sense strand and/or the antisense strand. 
The term "siRNA" includes duplexes of two separate strands, as well as single strands 
that can form hairpin structures comprising a duplex region. 

1 0 SiRNA may be divided into five (5) groups (non-functional, semi-functional, 

functional, higjily functional, and hyper-functional) based on the level or degree of 
silencing that they induce in cultured cell lines. As used herein, these definitions are 
based on a set of conditions where the siRNA is transfected into said cell hne at a 
concentration of lOOnM and the level of silencing is tested at a time of roughly 24 

1 5 hours after transfection, and not exceeding 72 hours after transfection. In this context, 
"non-flinctional siRNA" are defined as those siRNA that induce less than 50% 
(<50%) target silencing. "Semi-functional siRNA" induce 50-79% target silencing. 
"Functional siRNA" are molecules that induce 80-95% gene silencing. "Highly- 
functional siRNA" are molecules that induce greater than 95% gene silencing. 

20 "Hyperfiinctional siRNA" are a special class of molecules: -For purposes of this 
document, hyperfimctional siRNA are defined as those molecules that: (1) induce 
greater than 95% silencing of a specific target when they are transfected at 
subnanomolar concentrations (i.e., less than one nanomolar); and/or (2) induce 
fimctional (or better) levels of silencing for greater than 96 hours. These relative 

25 functionalities (though not intended to be absolutes) may be used to compare siRNAs 
to a particular target for applications such as functional genomics, target identification 
and therapeutics. 

miRNA 

30 The term "miRNA" refers to microRNA. 



Gene silencing 

The phrase "gene silencing" refers to a process by which the expression of a 
specific gene product is lessened or attenuated. Gene silencing can take place by a 
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variety of pathways. Unless specified otherwise, as used herin, gene silencing refers 
to decreases in gene product expression that results fi-om RNA interference (RNAi), a 
defined, though partially characterized pathway whereby small inhibitory RNA 
(siRNA) act in concert with host proteins (e.g. the RNA induced silencing complex, 

5 RISC) to degrade messenger RNA (mRNA) in a sequence-dependent fashion. The 
. level of gene silencing can be measured by a variety of means, including, but not 
limited to, measurement of transcript levels by Northern Blot Analysis, B-DNA 
techniques, transcription-sensitive reporter constructs, expression profiling {e.g. DNA 
chips), and related technologies. Altematively, the level of silencing can be measvired 

D by assessing the level of the protein encoded by a specific gene. This can be 
accomplished by performing a number of studies including Western Analysis, 
measuring the levels of expression of a reporter protein that has e.g. fluorescent 
properties (e.g. GFP) or enzymatic activity (e.g. alkaline phosphatases), or several 
other procedures. 

5 ■ 

Transfection 

The term "transfection" refers to a process by which agents are introduced into 
a cell. The list of agents that can be transfected is large and includes, but is not 
limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more 
0 genes and organized into an expression plasmid, proteins, protefn fi-agments, and 

more. There are multiple methods for transfecting agents into a cell including, but not 
limited to, electroporation, calcium phosphate-based transfections, DEAE-dextran- 
based hsansfecldens, lipid-based transfections, molecular conjugate-based transfections 
{e.g. polylysine-DNA conjugates), microinjection and others. 



The term "target" is used in a variety of different forms throughout this 
docimient and is defined by the context in which it is used. "Target mRNA" refers to 
a messenger RNA to which a given siRNA can be directed against. "Target sequence" 
and "target site" refer to a sequence within the mRNA to which the sense strand of an 
siRNA shows varying degrees of homology and the antisense strand exhibits varying 
degrees of complementarity. The term "siRNA target" can refer to the gene, mRNA, 
or protein against which an siRNA is directed. Similarly "target silencing" can refer 
to the state of a gene, or the corresponding mRNA or protein. 
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Off-target silencing and Off-target interference 

The phrases "off-target silencing" and "off-target interference" are defined as 
degradation of mRNA other than the intended target roRNA due to overlapping and/or 
5 partial homology with secondary mRNA messages. 

SMARTscore™ 

The term "SMARTscore'^'*^" refers to a number determined by applying any of 
the Formulas I - Formula DC to a given siRNA sequence. The term "SMART- 
1 0 selected" or "rationally selected" or "rational selection" refers to siRNA that have 
been selected on the basis of their SMARTscores™. 



Complementary 

The term "complementary" refers to the ability of polynucleotides to form 
15 base pairs with one another. Base pairs are typically formed by hydrogen bonds 
between nucleotide units in antiparallel polynucleotide strands. Complementary 
polynucleotide strands can base pair in the Watson-Crick manner {e.g., A to T, A to 
U, C to G), or in any other manner that allows for the formation of duplexes. As 
persons skilled in the art are aware, vfhea using KNA as opposed to DNA, uracil 
20 rather than thymine is the base that is considered to be complementary to adenosine. 
However, when a U is denoted in the context of the present invention, the ability to 
substitute a T is implied, unless otherwise stated. 

Perfect complementarity or 100% complementarity refers to the situation in 
25 which each nucleotide unit of one polynucleotide strand can hydrogen bond with a 
nucleotide unit of a second polynucleotide strand. Less than perfect complementarity 
refers to the situation in which some, but not all, nucleotide units of two strands can 
hydrogen bond with each other. For example, for two 20-mers, if only two base pairs 
on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 
30 10% complementarity. In the same example, if 18 base pairs on each strand can 
hydrogen bond with each other, the polynucleotide strands exhibit 90% 
complementarity. "Substantial complementarity" refers to polynucleotide strands 
exhibiting 79% or greater complementarity, excluding regions of the polynucleotide 
strands, such as overhangs, that are selected so as to be noncomplementary. 
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("Substantial similarity" refers to polynucleotide strands exhibiting 79% or greater 
similarity, excluding regions of the polynucleotide strands, such as overhangs, that are 
selected so as not to be similar.) Thus, for example, two polynucleotides of 29 
nucleotide units each, wherein each comprises a di-dT at the 3' terminus such that the 
5 duplex region spans 27 bases, and wherein 26 of the 27 bases of the duplex region on 
each strand are complementary, are substantially complementary since they are 96.3% 
complementary when excluding the di-dT overhangs. 



Deoxvnucleotide 

1 0 The term "deoxynucleotide" refers to a nucleotide or polynucleotide lacking a 

hydroxyl group (OH group) at the 2' and/or 3' position of a sugar moiety. Instead, it 
has a hydrogen bonded to the T and/or 3' carbon. Within an RNA molecule that 
comprises one or more deoxynucleotides, "deoxynucleotide" refers to the lack of an 
OH group at flie 2' position of the sugar moiety, having instead a hydrogen bonded 

1 5 directly to the 2 ' carbon. 



Deoxvribonuclentide 

The terms "deoxyribonucleotide" and "DNA" refer to a nucleotide or 
polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, 
at its 2 ' and/or 3 'position. . ^ 

The phrasei^substantially similar" refers to a similarity of at least 90% with 
respect to the identity of the bases of the sequence. 

Duplex Region 

The phrase "duplex region" refers to the region in two complementary or 
substantially complementary polynucleotides that form base pairs with one another, 
either by Watson-Crick base pairing or any other manner that allows for a stabilized 
duplex between polynucleotide strands that are complementary or substantially 
complementary. For example, a polynucleotide strand having 21 nucleotide units can 
base pair with another polynucleotide of 21 nucleotide units, yet only 19 bases on 
each strand are complementary or substantially complementary, such that the "duplex 
region" has 19 base pairs. The remaining bases may, for example, exist as 5' and 3' 
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overhangs. Further, within the duplex region, 100% complementarity is not required; 
substantial complementarity is allowable within a duplex region. Substantial 
complementarity refers to 79% or greater complementarity. For example, a mismatch 
in a duplex region consisting of 19 base pairs results in 94.7% complementarity, 
5 rendering the duplex region substantially complementary. 

Nucleotide 

The term "nucleotide" refers to a ribonucleotide or a deoxyribonucleotide or 
modified form thereof, as well as an analog thereof. Nucleotides include species that 
10 comprise purines, e.g., adenine, hypoxanthine, guanine, and their derivatives and 
analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives 
and analogs: 

Nucleotide analogs include nucleotides having modifications in the chemical 
1 5 structure of the base, sugar and/or phosphate, including, but not limited to, 5-position 
pjoimidine modifications, 8-position purine modifications, modifications at cytosine 
exocychc amines, and substitution of 5-bromo-uracil; and 2'-position sugar 
modifications, including but not limited to, sugar-modified ribonucleotides in which 
the 2'-0H is replaced by a group such as an H, OR, R, halo, SH, SR, NH2, NHR, 
20 NR2, or CN, wherein R is an alkyl moiety. Nucleotide analogs are also meant to 
include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 
2 '-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, 
phosphorothioates and-peptides. 

25 Modified bases refer to nucleotide bases such as, for example, adenine, 

guanine, cytosine, thymine, uracil, xanthine, inosine, and queuosine that have been 
modified by the replacement or addition of one or more atoms or groups. Some 
examples of types of modifications that can comprise nucleotides that are modified 
with respect to the base moieties include but are not limited to, alkylated, halogenated, 

30 thiolated, aminated, amidated, or acetylated bases, individually or in combination. 
More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 
6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2- 
propylguanine, 2-aminoademne, 1-methylinosine, 3-methyluridine, 5-methylcytidine, 
5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2- 
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ainino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1- 
methyladenosine, 2-methyladenosine, 3-methylcytidine, 6-methyluridme, 2- 
methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5- 
methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza- 
5 adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2-thiouridine, other 
thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, 
pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any 
O- and N-alkylated purines and pyrimidines such as N6-methyladenosme, 5- 
methylcarbonylmethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridme-2- 

1 0 one, phenyl and modified phenyl groups such as aminophenol or 2,4,6-trimethoxy 
benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines 
and guanines, 5-substituted uracils and thymines, azapyrimidines, 
caiboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyl nucleotides, and 
alkylcarbonylalkylated nucleotides. Modified nucleotides also include those 

1 5 nucleotides that are modified with respect to the sugar moiety, as well as nucleotides 
having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties 
may be, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4'- 
thioribose, and other sugars, heterocycles, or carbocycles. 

20 The term nucleotide is also meant to include what are known in the art as 

universal bases. By way of example, universal bases include but are not limited to 3- 
nitropyrrole, 5-nitroindole, or nebularine. The term "nucleotide" is also meant to 
include the N3' to PSSphosphor-amidate, resulting firom the substitution of a ribosyl 3' 
oxygen with an amine group. 

25 

Further, the term nucleotide also includes those species that have a detectable 
label, such as for example a radioactive or fluorescent moiety, or mass label attached 
to the nucleotide. 

30 Polynucleotide 

The term "polynucleotide" refers to polymers of nucleotides, and includes but 
is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of 
regularly and/or irregularly altemating deoxyribosyi moieties and ribosyl moieties 
(i.e., wherein alternate nucleotide units have an -OH, then and -H, then an -OH, then 
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an -H, and so on at the 2' position of a sugar moiety), and modifications of these 
kinds of polynucleotides, wherein the attachment of various entities or moieties to the 
nucleotide units at any position are included. 

) Pol\Tibonucleotide 

The term "polyribonucleotide" refers to a polynucleotide comprising two or 
more modified or unmodified ribonucleotides and/or their analogs. The term 
"polyribonucleotide" is used interchangeably with the term "oligoribonucleotide." 

3 Ribonucleotide and ribonucleic acid 

The term "ribonucleotide" and the phrase "ribonucleic acid" (RNA), refer to a 
modified or unmodified nucleotide or polynucleotide comprising at least one 
ribonucleotide unit. A ribonucleotide unit comprises an hydroxyl group attached to 
the 2' position of a ribosyl moiety that has a nitrogenous base attached in N- 

5 glycosidic linkage at the 1 ' position of a ribosyl moiety, and a moiety that either 
allows for linkage to another nucleotide or precludes linkage. 

Detailed Description of the Invention 

0 The present invention is directed to improving the eflSciency of gene silencing 

by siRNA. Through the inclusion of multiple siRNA sequences that are targeted to a 
particular gene and/or selecting an siRNA sequence based on certain defined criteria, 
improved efficiency may bea d^eyed. 

5 The present invention will now be described in connection with preferred 

embodiments. These embodiments are presented in order to aid in an understanding 
of the present invention and are not intended, and should not be construed, to limit the 
invention in any way. All alternatives, modifications and equivalents that may 
become apparent to those of ordinary skill upon reading this disclosure are included 

0 within the spirit and scope of the present invention. 



Furthermore, this disclosure is not a primer on RNA interference. Basic 
concepts known to persons skilled in the art have not been set forth in detail. 
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Optimizing siRNA 

According to one embodiment, the present invention provides a method for 
improving the effectiveness of gene silencing for use to silence a particular gene 
through the selection of an optimal siRNA. An siRNA selected according to this 
5 method may be used individually, or in conjunction with the first embodiment, i.e., 
with one or more other siRNAs, each of which may or may not be selected by this 
criteria in order to maximize their efficiency. 



The degree to which it is possible to select an siRNA for a given mRNA that 
maximizes these criteria will depend on the sequence of the mRNA itself. However, 
the selection criteria will be independent of the target sequence. According to this 
method, an siRNA is selected for a given gene by using a rational design. That said, 
rational design can be described in a variety of ways. Rational design is, in simplest 
terms, the application of a proven set of criteria that enhance the probability of 
identifying a functional or hyperfiinctional siRNA. In one method, rationally 
designed siRNA can be identified by maximizing one or more of the following 
criteria: 

1 . A low GC content, preferably between about 30 -52%. 

2. At least 2, preferably at least 3 A or U bases at positions 1 5- 1 9 of the 
siRNA on the sense strand. 

3. An A base at position 1 9 of the sense strand. 

4. An A base at positiqibBsof the sense strand. 

5. A U base at position 1 0 of the sense strand. 

6. An A base at position 14 of the sense strand. 

7. A base other than C at position 19 of the sense strand. 

8. A base other than G at position 1 3 of the sense strand. 

9. A Tm, which refers to the character of the internal repeat that results in 
inter- or intramolecular structures for one strand of the duplex, that is 
preferably not stable at greater than 50°C, more preferably not stable at 
greater than 37°C, even more preferably not stable at greater than 30°C 
and most preferably not stable at greater than 20°C. 

10. A base other than U at position 5 of the sense strand. 

11. A base other than A at position 1 1 of the sense strand. 
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Criteria 5, 6, 10 and 1 1 are minor criteria, but are nonetheless desirable. 
Accordingly, preferably an siRNA will satisfy as many of the aforementioned criteria 
as possible, more preferably at least 1 - 4 and 7-9, and most preferably all of the 
5 criteria 

With respect to the criteria, GC content, as well as a high number of AU in 
positions 15-19, may be important for easement of the unwinding of double stranded 
siRNA duplex. Duplex unwinding has been shown to be crucial for siRNA 
1 0 functionality in vivo. 

With respect to criteridn 9, the internal structure is measured in terms of the 
melting temperature of the single strand of siRNA, which is the temperature at which 
50% of the molecules will become denatured. With respect to criteria 2-8 and 10 - 
15 1 1 , the positions refer to sequence positions on the sense strand, which is the strand 
that is identical to the mRNA. 

In one preferred embodiment, at least criteria 1 and 8 are satisfied. In another 
preferred embodiment, at least criteria 7 and 8 are satisfied. In still another preferred 
20.. '• embodiment, at least criteria 1, 8 and 9 are satisfied. 

It should be noted that all of the aforementioned criteria regarding sequence 
position specifics are with respect toith^S^tend of the sense strand. Reference is 
made to the sense strand, because most databases contain information that describes 
25 the information of the mRNA. Because according to the present invention a chain can 
be from 1 8 to 30 bases in length, and the aforementioned criteria assumes a chain 19 
base pairs in length, it is important to keep the aforementioned criteria apphcable to 
the correct bases. 

30 When there are only 1 8 bases, the base pair that is not present is the base pair 

that is located at the 3' of the sense strand. When there are twenty to thirty bases 
present, then additional bases are added at the 5' end of the sense chain and occupy 
positions ~1 to "1 1 . Accordingly, with respect to SEQ. ID NO. 0001 . 
NNANANNNNUCNAA'NNNNA and SEQ. ID NO. 0028. 
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GUCNNANANKNNUCNAANNNNA, both would have A at position 3, A at 
position 5, U at position 10, C at position 1 1, A and position 13, A and position 14 
and A at position 19. However, SEQ. ID NO. 0028 would also have C at position -1 , 
U at position -2 and G at position -3. 

5 

For a 1 9 base pair siRNA, an optimal sequence of one of the strands may be 
represented below, where N is any base, A, C, G, or U: 

SEQ. ID NO. 0001. NNANANNNNUCNAANNNNA 
10 SEQ. ID NO. 0002. NNANANNNNUCNAANNNNA 

SEQ. IDNO. 0003. NNANANNNNUUNAANNNNA 

SEQ. ID NO. 0004. NNANA]SH>]NNUCNCANNNNA 

SEQ. ID NO. 0005. NNANANNNNUGNCANNNNA 

SEQ. ID NO. 0006. NNANANNNNUUNCANNNNA 
15 SEQ. ID NO. 0007. NNANANNNNUCNUANNNNA 

SEQ. ID NO. 0008.. NNANANNNNUGNUANNNNA 

SEQ. ID NO. 0009. NNANANNNNUUNUANNNNA 

SEQ. ID NO. 0010. NNANCNNNNUCNAANNNNA 

SEQ. ID NO. 001 1 . NNANCNNNNUGNAANNNNA 
20 ..SEQ. ID NO. 0012. NNANCmiNNUUNAANNNNA 

SEQ. ID NO. 0013. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0014. NNANCNNNNUCNCANNNNA 

SEQ. ID NO. 0015. NNANCNNNNUtiiifiA^BUNNA 

SEQ. ID NO. 0016. NANCNNNNUCNUANNNNA 
25 SEQ. ID NO. 0017. NNANCNNNNUCNUANNNNA 

SEQ. ID NO. 0018. NNANCNNNNtnJNUANNNNA 

SEQ. ID NO. 001 9. NNANCNNNNUCNAANNNNA 

SEQ. ID NO. 0020. NNANGNNNNUGNAANNNNA 

SEQ. ID NO. 0021 . NNANGNNNNUUNAANNNNA 
30 SEQ. ID NO. 0022. NNANGNNNNUCNCANNNNA 

SEQ. ID NO. 0023. NNANGNNNNUCNCANNNNA 

SEQ. ID NO. 0024. NNANGNNNNUUNCANNNNA 

SEQ. ID NO. 0025. NNANCNNNNUCNUANNNNA 

SEQ. ID NO. 0026. NNANCNNNNUCNUANNNNA 
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SEQ. JD NO. 0027. NNANGNNNNNUNUANNNNA 

In one embodiment, the sequence used as an siRNA is selected by choosing 
the siRNA that score highest according to one of the following seven algorithms that 
5 are represented by Fonnulas I - VII: 



Relative functionality of siRNA= -{003) +(AU,5-,9) -<Tm2ooc)*3 -(Gi3)*3 -(€,9) 
+(A,9)*2 +(A3) +(U,o)+(A,4) -(Us) -(An) 

10 



Relative functionality of siRNA= ^(GC/3) -(AU,5.i9)*3 -{Gn)*3 -(€,9) +(A,9)*2 

+(A3) 



15 Formula III 

Relative functionality of siRNA= -(GC/3) +(AU,5.i9) -(Tm20'c)*3 

Formula IV 

Relative functionality of siRNA= 
20 -GC/J+( AU,5.,9)/2-(Tm2o-c)*2 -(G,3)*3 -(€,9) *(A,9)*2 h-(A3) +(U.oWA.4) -(U5) - 
(An) 



Relative functionality of siRNA=.(G,3)*3 -(Q9) +(A,9)*2 ^A,) + (U,o)+(Ah) -(U5) 
25 -(An) 



Formula VI 

Relative fimctionaUty of siRNA=-(G,3)*3 -(0,9) +(A,9)*2 +(A3) 
30 FonnulaVn 

Relative functionality of siRNA=-(GC/2) +(AU,5-,9)/2 -( Tm^o-c)*! -(G,3)*3 -(€,9) 
+(A,9)*3 +(A3)*3 +(U,o)/2+(A,4)/2 -(Us)/2 -(A„)/2 



In Formulas I - VII: 
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wherein A19 = 1 if A is the base at position 19 on the sense strand, otherwise its value 

is 0, 

AU15.19 = 0-5 depending on the number of A or U bases on the sense strand 

at 

5 positions 15 -19; 

Gi3= 1 if G is the base at position 13 on the sense strand, otherwise its value is 

0; 

C]9 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 

0; 

10 GC= the number of G and C bases in the entire sense strand; 

Tm 20°c=l if the Tm is greater than 20°C; 

A3= 1 if A is the base at position' s on the sense strand, otherwise its value is 0; 
Uior= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

15 A 14 = 1 if A is the base at position 14 on the sense strand, otherwise its value 

is 0; 

U5 = 1 if U is the base at position 5 on the sense strand, otherwise its value is 

0; and 

Ai 1 = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 
20 0. . ' 



Formulas I -VII provide relative information regarding functionality. When 
the values for two sequences are compared for^ ,given>formula, the relative 
fimctionality is ascertained; a higher positive number indicates a greater functionality. 
25 For example, in many applications a value of 5 or greater is beneficial. 



Additionally, in many applications, more than one of these formulas would 
provide useful information as to the relative functionality of potential siRNA 
sequences. However, it is beneficial to have more than one type of formula, because 
30 not every formula will be able to help to differentiate among potential siRNA 

sequences. For example, in particularly high GC mRNAs, formulas that take that 
parameter into account would not be useful and application of formulas that lack GC 
elements {e.g., formulas V and VI) might provide greater insights into duplex 
functionality. Similarly, formula II might by used in situations where hairpin 
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Structures are not observed in duplexes, and formula IV might be applicable for 
sequences that have higher AU content. Thus, one may consider a particular sequence 
in light of more than one or even all of these algorithms to obtain the best 
differentiation among sequences. In some instances, application of a given algorithim 
5 may identify an unususally large number of potential siRNA sequences, and in those 
cases, it may be appropriate to re-analyze that sequence with a second algorithm that 
is, for instance, more stringent. Alternatively, it is conceivable that analysis of a 
sequence with a given formula yields no acceptable siRNA sequences {i.e. low 
SMARTscoresTM). hi this instance, it may be appropriate to re-analyze that sequences 

1 0 with a second algorithm that is, for instance, less stringent, hi still other instances, 
analysis of a single sequence with two separate formulas may give rise to conflicting 
results (i.e. one formula generates a set of siRNA with high SMARTscores™ while 
the other formula identifies a set of siRNA with low SMARTscores™). In these 
instances, it may be necessary to determine which weighted factor(s) (e.g. GC 

1 5 content) are contributing to the discrepancy and assessing the sequence to decide 
whether these factors should or should not be mcluded. Alternatively, the sequence 
could be analyzed by a third, fourth, or fifth algorithm to identify a set of rationally 
designed siRNA. 

20 The above-referenced criteria are particularly advantageous when used in 

combination with pooling techniques as depicted in Table I: 



Table I 



Criteria 


Functional Probabihty 




Oligos 


Pools 




>95% 


>80% 


<70% 


>95% 


>80% 


<70% 


Current 


33.0 


50.0 


23.0 


79.5 


97.3 


0.3 


New 


50.0 


88.5 


8.0 


93.8 


99.98 


0.005 


(GC) 


28.0 


58.9 


36.0 


72.8 


97.1 


1.6 



The term "current" refers to Tuschl's conventional siKNA parameters (Elbashir, S.M. 
et al. (2002) "Analysis of gene function in somatic mammalian cells using small 
interfering RNAs" Methods 26: 199-213). "New" refers to the design parameters 
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described in Formulas I-VII. "GC" refers to criteria that select siRNA solely on the 
basis of GC content. 



As Table I indicates, when more functional siRNA duplexes are chosen, 
5 siRNAs that produce <70% silencing drops fi-om 23% to 8% and the number of 
siRNA duplexes that produce >80% silencmg rises from 50% to 88.5%. Further, of 
the siRNA duplexes with >80% silencing, a larger portion of these siRNAs actually 
silence >95% of the target expression (the new criteria increases the portion irom 
33% to 50%). Using this new criteria ui pooled siRNAs, shows that, with pooling, the 
1 0 amount of silencing >95% increases from 79.5% to 93 .8% and essentially eliminates 
any siRNA pool from silencing less than 70%. 



Table II similarly shows the particularly beneficial results of pooling in 
combination with the aforementioned criteria However, Table II, which takes into 
1 5 account each of the aforementioned variables, demonstrates even a greater degree of 
improvement m fiinctionality. 



Table n 





Functional Probability 


Oligos 


Pools 


Functional 


Average 


Non- 
fiinctional 


Functional 


Average 


Non- 
functional 


Random 


20 


40 


50 


67 


97 


3 


Criteria 1 


52 


99 


0.1 


97-^--a-^ 


-93 


0.0040 


Criteria 4 


89 


99 


0.1 


99 


99 


0.0000 



The terms "functional," "Average," and "Non- functional" refer to siRNA that exhibit 
20 >80%, >50%, and <50% functionality, respectively. Criteria 1 and 4 refer to specific 
criteria described above. 



25 



The above-described algorithms may be used with or without a computer 
program that allows for the mputting of the sequence of the mRNA and automatically 
outputs the optimal siRNA. The computer program may, for example, be accessible 
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from a local tenninal or personal computer, over an internal network or over the 
Internet. 

In addition to the formulas above, more detailed algorithms may be used for 
5 selecting siRNA. Preferably, at least one RNA duplex of between 1 8 and 30 base 
pairs is selected such that it is optimized according a formula selected from: 



Formula VIII: (-14)*G,3-13*Ai-12*U7-ll*U2-10*A„-10*U4-10*C3-10*Cs-10*C6- 
1 0 9*Aio-9*U9-9*C,8-8*Gio-7*U,-7*U,6-7*Ci7-7*C,9 

+7*Ui7+8*A2+8*A4 +8*Arl-8*C4 +9*G8 +10*A7+1 0*Ui8+l 1 *Ai9+ 
11*C9+15*G,+ 18*A3+19*U,o-Tm^3* (GCtotoi)-6*(GC,5-,9)- 
30*X;and 

15 Formula DC: (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*Ai9+(14.2)*U,o+(10.5)* 
C9+(23.9)*Gi+(16.3)*G2+(-12.3)*A,i+(-19.3)*Ui+(-12.1)*U2+ 
(-ll)*U3+(-15.2)*Ui5+(-11.3)nj,6+(-11.8)*C3+(-17.4)*Cfff(- 
10.5) *€,+ (-13.7)*Gi3+(-25.9)*Gi9-Tm-3* (GQotai)- 6*(GCi5-i9)- 
30*X 



wherein 

Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A2 = 1 if A is the base at position 2 of the sense strand, otherwise. its value is 0; 
A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
As = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
Ae = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
All = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
Ai3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
Ai9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 
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C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
Ce = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
5 C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
1 0 present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2= 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gs = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
15 Gio = 1 if G is the base at position 1 0 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

20 Ui = 1 if U is the base -at position 1 on the sense strand, otherwise its value is 0; 

U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 

U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otheisase its -value is 0; 

U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
25 U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 

Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 

Uis = 1 if U is the base at position 1 5 on the sense strand, otherwise its value is 0; 

U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

Ui7 = 1 if U is the base at position 1 7 on the sense strand, otherwise its value is 0; 
30 U18 = 1 if U is the base at position 1 8 on the sense strand, otherwise its value is 0; 



GC15-19 = the number of G and C bases within positions 15-19 of the sense 

strand, or within positions 15 -18 if the sense strand is only 18 base pairs in 
length; 
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GCtotai = the mimber of G and C bases in the sense strand; 

Tm = 100 if the siRNA oligo has the internal repeat longer then 4 base paks, 

otherwise its value is 0; and 
X = the number of times that the same nucleotide repeats four or more times in a 
5 row. 

The above formulas VIII and IX, as well as formulas I - VII, provide methods 
for selecting siRNA in order to increase the efficiency of gene silencing. A subset of 
variables of any of the formulas may be used, though when fewer variables are used, 
1 0 the optimization hierarchy becomes less reliable. 

With respect to the variables of the above-referenced formulas, a single letter 
of A or C or G or U followed by a subscript refers to a binary condition. The binary 
condition is that either the particular base is present at that particular position 

1 5 (wherein the value is "1 ") or the base is not present (wherein the value is "0"). 

Because position 19 is optional, i.e. there might be only 1 8 base pairs, when there are 
only 1 8 base pairs, any base with a subscaipt of 1 9 in the formulas above would have 
a zero value for that parameter. Before or after each variable is a number followed by 
*, which indicates that the value of the variable is to be multiplied or weighed by that 

20 number. 

The numbers preceding the variables A, or G, or C, or U in Formulas VIII and 
IX (or after the variables in Formula I - VII) were determined by^eomrpaidag the 
difference in the frequency of individual bases at different positions in functional 

25 siRNA and total siRNA. Specifically, the frequency in which a given base was 
observed at a particular position in fiinctional groups was compared with the 
frequency that that same base was observed in the total, randomly selected siRNA set. 
If the absolute value of the difference between the fimctional and total values was 
found to be greater than 6%, that parameter was included in the equation. Thus for 

30 instance, if the frequency of finding a "G" at position 1 3 (Gn) is found to be 6% in a 
given functional group, and the frequency of G13 in the total population of siRNAs is 
20%, the difference between the two values is 6%-20% = -14%. As the absolute value 
is greater than six (6), this factor (-14) is included in the equation. Thus in Formula 
VIII, in cases where the siRNA under study has a G in position 13, the accrued value 
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is (-14) * (1) = -14. In contrast, when a base other than G is foxind at position 13, the 
accrued value is (-14) * (0) = 0. 



When developing a means to optimize siRNAs, the inventors observed that a 
5 bias toward low internal thermodynamic stability of the duplex at the 5'-antisense 
(AS) end is characteristic of naturally occurring miRNA precursors. The inventors 
extended this observation to siRNAs for which functionality had been assessed in 
tissue culture. 



1 0 With respect to the parameter GC15-19, a value of 0 - 5 will be ascribed 

depending on the number of G or C bases at positions 15 to 19. If there are only 18 
base pairs, the value is between 0 and 4. ■ 



With respect to the criterion GQotai content, a number &om 0-30 will be 
1 5 ascribed, which correlates to the total number of G and C nucleotides on the sense 
strand, excluding overhangs. Without wishing to be bound by any one theory, it is 
poshilated that the significance of the GC content (as well as AU content at positions 
15-19, which is a parameter for formulas III - VII) relates to the easement of the 
unwinding of a double-stranded siRNA duplex. Duplex imwinding is believed to be 
20 crucial for siRNA functionality in vivo and overall low intemal stability, especially 
low intemal stability of the first unwound base pair is beHeved to be important to 
maintain sufficient processivity of RISC complex-induced duplex unwinding. If the 
duplex has 19 base pairs, those at positions 1 5-19 on the sense strandiwilkmwind first 
if the molecule exhibits a sufficiently low intemal stability at that position. As 
25 persons skilled in the art are aware, RISC is a complex of approximately twelve 
proteins; Dicer is one, but not the only, hehcase within this complex. Accordingly, 
although the GC parameters are believed to relate to activity with Dicer, they are also 
important for activity with other RISC proteins. 

30 The value of the parameter Tm is 0 when there are no intemal repeats longer 

than (or equal to) four base pairs present in the siRNA duplex; otherwise the value is 
1 . Thus for example, if the sequence ACGUACGU, or any other four nucleotide (or 
more) palindrome exists within the stracture, the value will be one (1). Alternatively 
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if the structure ACGGACG, or any other 3 nucleotide (or less) palindrome exists, the 
value will be zero (0). 

The variable "X" refers to the number of times that the same nucleotide occurs 
5 contiguously in a stretch of four or more units. If there are, for example, four 
contiguous As in one part of the sequence and elsewhere in the sequence four 
contiguous Cs, X =2. Further, if there are two separate contiguous stretches of four of 
the same nucleotides or eight or more of the same nucleotides in a row, then X=2. 
However, X does not increase for five, six or seven contiguous nucleotides. 

10 

Again, when applying Fonnula VIII or Formula DC to a given mRNA, (the 
"target RNA" or "target molecule"), one may use a computer program to evaluate the 
criteria for every sequence of 1 8 - 30 base pairs or only sequences of a fixed length, 
e.g., 19 base pairs. Preferably the computer program is designed such that it provides 

1 5 a report ranking of all of fhs potential siRNAs between 1 8 and 30 base pairs, ranked 
according to which sequences generate the highest value. A higher value refers to a 
more efficient siRNA for a particular target gene. The computer program that may be 
used, may be developed in any computer language that is known to be useftil for 
scoring nucleotide sequences, or it may be developed with the assistance of 

20 conamercially available. product such as Microsoft's product .net. Additionally, rather 
than run every sequence through one and/or another formula, one may compare a 
subset of the sequences, which may be desirable if for example only a subset are 
available. For instance, it may be desirable to first perform a BLAST (BasiGrJbQcal. . 
Aligmnent Search Tool) search and to identify sequences that have no homology to 

25 ottiar targets. Alternatively, it may be desirable to scan the sequence and to identify 
regions of moderate GC context, then perform relevant calculations using one of the 
above-described formulas on these regions. These calculations can be done manually 
or with the aid of a computer. 

30 As with Formulas I - VII, either Formula VIII or Formula EX may be used for 

a given mRNA target sequence. However, it is possible that according to one or the 
other formula more than one siRNA will have the same value. Accordingly, it is 
beneficial to have a second formula by which to differentiate sequences. Fonnula IX 
was derived in a similar fashion as Formula VIII, yet used a larger data set and thus 
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yields sequences with higher statistical correlations to highly fimctional diq)lexes. The 
sequence that has the highest value ascribed to it may be referred to as a "first 
optimized duplex." The sequence that has the second highest value ascribed to it may 
be referred to as a "second optimized duplex." Similarly, the sequences that have the 
5 third and fourth highest values ascribed to them may be referred to as a third 

optimized duplex and a fourth optimized duplex, respectively. When more than one 
sequence has the same value, each of them may, for example, be referred to as first 
optimized duplex sequences or co-first optimized duplexes. 

1 0 SiRN A sequences identified using Formula VIII are contained within the 

enclosed compact disks. The data included on the enclosed compact disks is 
described more fiilly below. The sequences identified by Formula VIII that are 
disclosed in the compacts disks maybe used in gene silencing applications. 

15 It should be noted that for Formulas VIII and DC all of the aforementioned 

criteria are identified as positions on the sense strand when oriented in the 5' to 3' 
direction as they are identified in connection with Formulas I - VII unless otherwise 
specified. 

Formulas I - DC, may be usedJo-select or to evaluate one, or more flian one, 
siRNA in order to optimize silencing. Preferably, at least two optimized siRNAs that 
have been selected according to at least one of these formulas are used to silence a 
gene, more preferably at least three and most preferably at least four. The siRNAs;...-:^ 
may be used individually or together in a pool or kit. Further, they may be applied to 
a cell simultaneously or separately. Preferably, the at least two siRNAs are applied 
simultaneously. Pools are particularly beneficial for many research appUcations. 
However, for therapeutics, it may be more desirable to employ a single 
hyperfunctional siRNA as described elsewhere in this application. 

30 When planning to conduct gene silencing, and it is necessary to choose 

between two or more siRNAs, one should do so by comparing the relative values 
when the siRNA are subjected to one of the formulas above. In general a higher 
scored siRNA should be used. 
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Useful applications include, but are not limited to, target validation, gene 
functional analysis, research and drug discovery, gene therapy and therapeutics. 
Methods for using siRNA in these applications are well known to persons of skill in 
the art. 

5 

Because the ability of siRNA to function is dependent on the sequence of the 
RNA and not the species into which it is introduced, the present invention is 
applicable across a broad range of species, including but not lunited to all mammalian 
species, such as humans, dogs, horses, cats, cows, mice, hamsters, chimpanzees and 
10 gorillas, as well as other species and organisms such as bacteria, viruses, insects, 
plants and C. elegans. 

The present invention is also applicable for use for silencing a broad range of 
genes, including but not limited to the roughly 45,000 genes of a human genome, and 
1 5 has particular relevance in cases where those genes are associated with diseases such 
as diabetes, Alzheimer's, cancer, as well as all genes in the genomes of the 
aforementioned organisms. 

The siRNA selected according to the aforementioned criteria or one of the 
20 aforementioned algorithms are also, for example, useM in the simultaneous screening 
and functional analysis of multiple genes and gene families using high throughput 
strategies, as well as in direct gene suppression or silencing. 

Development of the Algorithms 

25 To identify siRNA sequence features that promote functionality and to 

quantify the importance of certain currently accepted conventional factors — such as 
G/C content and target site accessibility — ^the inventors synthesized an siRNA panel 
consisting of 270 siRNAs targeting three genes, Human Cyclophilin, Firefly 
Luciferase, and Human DBI. In all three cases, siRNAs were directed against specific 

30 regions of each gene. For Human Cyclophilin and Firefly Luciferase, ninety siRNAs 
were directed against a 199 bp se^ent of each respective mRNA. For DBI, 90 
siRNAs were directed against a smaller, 109 base pair region of the mRNA. The 
sequences to which the siRNAs were dhected are provided below. 
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It should be noted that in certain sequences, "t" is present. This is because 
many databases contain information in this manner. However, the t denotes a uracil 
residue in mRNA and siRNA. Any algorithm will, unless otherwise specified, 

process a t in a sequence as a u. 

5 

Human cvclophilin: 193 — 390. M60857 
SEQ. ID NO. 29: 

gttccaaaaacagtggataattttgtggccttagctacaggagagaaaggatttggctacaaaaacagcaaattccatcgtgt 
aatcaaggacttcatgatccagggcggagacttcaccaggggagatggcacaggaggaaagagcatctacggtgagcg 
1 0 cttccccgatgagaacttcaaactgaagcactacgggcctggctggg 

Firefly luciferase: 1434—1631. U47298 TpGLS. Promega 't 
SEQ. ID NO. 30: 

tgaacttcccgccgccgttgttgttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtca 
15 agtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacg 
caagaaaaatcagagagatcctcataaaggccaagaagg 

DBl. NM 020548 < 202-310 feverv position'^ 
SEQ. ID NO. 0031: 

20 acgggcaaggccaagtgggatgcctggaatgagctgaaaggg^cttccaaggaagatgccatgaaagcttacatcaaca 
aagtagaagagctaaagaaaaaatacggg 

A list of the siRNAs appears in Table III (see Examples Section, Example II) 

25 The set of duplexes was analyzed to identify correlations between siRNA 

ftmctionality and other biophysical or thermodynamic properties. When the siRNA 
panel was analyzed in functional and non-functional subgroups, certain nucleotides 
were much more abundant at certain positions in functional or non-fiinctional groups. 
More specifically, ttie firequency of each nucleotide at each position in highly 

30 fimctional siRNA duplexes was compared with that of nonfunctional duplexes in 
order to assess the preference for or against any given nucleotide at every position. 
These analyses were used to determine important criteria to be included in the siRNA 
algorithms (Formulas VIII and DC). 
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The data set was also analyzed for distinguishing biophysical properties of 
siRNAs in the fimctional group, such as optimal percent of GC content, propensity for 
internal structures and regional thermodynamic stability. Of the presented criteria, 
several are involved in duplex recognition, RISC activation/duplex unwinding, and 
5 target cleavage catalysis. 

The original data set that was the source of the statistically derived criteria is 
shown in Figure 2. Additionally, this figure shows that random selection yields 
siRNA duplexes with unpredictable and widely varying silencing potencies as 

1 0 measured in tissue culture using HEK293 cells. In the figure, duplexes are plotted 
such that each x-axis tick-mark represents an individual siRNA, with each subsequent 
siRNA differing in target position by two nucleotides for Human Cyclophilin and 
Firefly Luciferase, and by one nucleotide for Human DBI. Furthermore, the y-axis 
denotes the level of target expression remaining after transfection of the duplex into 

1 5 cells and subsequent silencing of the target. 

SiRNA identified and optimized in this document work equally well in a wide 
range of cell types. Figure 3a shows the evaluation of thirty siRNAs targeting the 
DBI gene in three cell lines derived from different tissues. Each DBI siRNA displays 

20 very similar functionality in HEK293 (ATCC, CR3>1 573, human embryonic kidney), 
HeLa (ATCC, CCL-2, cervical epithelial adenocarcinoma) and DU145 (HTB-81, 
prostate) cells as deterimined by the B-DNA assay. Thus, siRNA fijnctionality is 
• determined by the primary sequence of the siRNA and not by the intracellular 
environment. Additionally, it should be noted that although the present invention 

25 provides for a determination of the fimctionality of siRNA for a given target, the same 
siRNA may silence more than one gene. For example, the complementary sequence 
of the silencing siRNA may be present in more than one gene. Accordingly, in these 
circumstances, it may be desirable not to use the siRNA with highest SMARTscore™. 
In such circumstances, it may be desirable to use the siRNA with the next highest 

30 SMARTscore™. 

To determine the relevance of G/C content in siRNA function, the G/C content 
of each duplex in the panel was calculated and the fimctional classes of siRNAs 
(<F50, > F50, > F80, > F95 where F refers to the percent gene silencing) were sorted 
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accordingly. The majority of the highly-functional siRNAs (>F95) fell within the 
G/C content range of 36%— 52% (Figure 3B). Twice as many non-ftmctional (< 
F50) duplexes fell within the high G/C content groups (>57% GC content) compared 
to the 36%— 52% group. The group with extremely low GC content (26% or less) 

5 contained a higher proportion of non-functional siRNAs and no highly-functional 
siRNAs. The G/C content range of 30%— 52% was therefore selected as Criterion I 
for siRNA functionality, consistent with the observation that a G/C range 30% — 70% 
promotes efficient RNAi targeting. Application of this criterion alone provided only a 
marginal increase in the probability of selecting functional siRNAs from the panel: 

10 selection of F50 and F95 siRNAs was improved by 3.6% and 2.2%, respectively. The 
siRNA panel presented here permitted a more systematic analysis and quantification 
of the importance of this criterion than that used previously. 

A relative measure of local internal stability is the A/U base pair (bp) content; 

1 5 therefore, the firequency of A/U bp was determined for each of the five terminal 
positions of the dvplex (5' sense (S)/5' antisense (AS)) of all siRNAs in the panel. 
Duplexes were then categorized by the nvimber of A/U bp in positions 1 — 5 and 15 — 
19 of the sense strand. The thermodynamic flexibility of the duplex 5 '-end (positions 
1_5; s) did not appear to correlate appreciably with silencing potency, while that of 

20 the 3'-end (positions 15 — 19; S) correlated with effici^t silencing. No duplexes 
lacking A/U bp in positions 1 5 — 19 were fvmctional. The presence of one A/U bp in 
this region conferred some degree of fiinctionality, but the presence of three or more 
A/Us:swas preferable and therefore defined as Criterion II. When applied to the test 
panel, only a marginal increase in the probability of functional siRNA selection was 

25 achieved: a 1 .8% and 2.3% increase for F50 and F95 duplexes, respectively (Table 
IV). 

The complanentary strands of siRNAs that contain internal repeats or 
palindromes may form internal fold-back structures. These hairpin-like structures 
30 exist in equihbrium with the duplexed form effectively reducing the concentration of 
functional duplexes. The propensity to form internal hairpins and their relative 
stability can be estimated by predicted melting temperatures. High Tm reflects a 
tendency to form hairpin structures. Lower Tm values indicate a lesser tendency to 
form hairpins. When the fimctional classes of siRNAs were sorted by T„ (Figure 3c), 
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the following trends were identified: duplexes lacking stable internal repeats were the 
most potent silencers (no F95 duplex with predicted hairpin structure T„ > 60 °C). hi 
contrast, about 60% of the duplexes in the groups having internal hairpins with 
calculated T,„ values less than 20 °C were F80. Thus, the stability of internal repeats 
5 is inversely proportional to the silencing effect and defines Criterion III (predicted 
hairpin structure Tm < 20 °C). 



Sequence-based determinants of siRNA functionality 

When the siRNA panel was sorted into fiinctional and non-fiinctional groups, 

1 0 the frequency of a specific nucleotide at each position in a fijnctional siRNA duplex 
was compared with that of a nonfimctional duplex in order to assess the preference for 
or against a certain nucleotide. Figure 4 shows the results of these queries and the 
subsequent resorting of the data set (from Figure 2). The data is separated into two 
sets: those duplexes that meet the criteria, a specific nucleotide in a certain position - 

1 5 grouped on the left (Selected) and those that do not - grouped on the right 

(Eliminated). The duplexes are fijrther sorted from most fimctional to least fimctional 
with the y-axis of Figure 4a-e representing tiie % expression i.e. the amount of 
silencing that is ehcited by the diiplex (Note: each position on the X-axis represents a 
different diq)lex). Statistical analysis revealed correlations between silencing and 

20 several sequence-related jproperties of siRNAs. Figure.4-and Table IV show 

quantitative analysis for the following five sequence-related properties of siRNA: (A) 
an A at position 19 of the sense strand; (B) an A at position 3 of the sense strand; (C) 
a U at position 1 0 of the sense strand; (D) a base other than G at position 1 3 of the 
sense strand; and (E) a base other than C at position 19 of the sense sbi^nd. 

25 

When the siRNAs in the panel were evaluated for the presence of an A at 
position 19 of the sense strand, the percentage of non-fiinctional duplexes decreased 
from 20% to 11. 8%, and the percentage of F95 duplexes increased from 21 .7% to 
29.4% (Table IV). Thus, the presence of an A in this position defined Criterion IV. 

30 

Another sequence-related property correlated with silencing was the presence 
of an A in position 3 of tiie sense sfrand (Figure 4b). Of the siRNAs with A3, 34.4% 
were F95, compared with 21 .7% randomly selected siRNAs. The presence of a U 
base in position 10 of the sense strand exhibited an even greater impact (Figure 4c). 
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Of the duplexes in this group, 41 .7% were F95. These properties became criteria V 
and VI, respectively. 

Two negative sequence-related criteria that were identified also appear on 
5 Figure 4. The absence of a G at position 1 3 of the sense strand, conferred a marginal 
increase in selecting functional duplexes (Figure 4d). Similarly, lack of a C at 
position 19 of the sense strand also correlated with fiinctionality (Figure 4e). Thus, 
among functional duplexes, position 19 was most likely occupied by A, and rarely 
occupied by C. These rules were defined as caiteria VII and VIII, respectively. 

D 

Application of each criterion individually provided marginal but statistically 
significant increases in the probability of selecting a potent siRNA. Although the 
results were informative, the inventors sougiht to maximize potency and therefore 
consider multiple criteria or parameters. Optimization is particularly important when 
5 developing therapeutics. Interestingly, the probability of selecting a functional siRNA 
based on each thermodynamic criteria was 2% — 4% higher than random, but 4% — 
8% higher for the sequence-related determinates. Presumably, tiiese sequence-related 
increases reflect the complexity of the RNAi mechanism and the multitude of protein- 
RNA interactions that are involved in RNAi-mediated silencing. 
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Table IV 



15 









Improvement 


Criterion 




over Random 


1. 30%— 52% G/C content 


<F50 


16.4% 






>F50 


83.6% 


3.6% 




>F80 


60.4% 


4.3% 




> F95 


23.9 A 


2.2% 


11. At lesst 3 AAJ bases at positions 


<F50 


18.2% 




1 5 — 1 9 of the sense strand 


>F50 


81.8% 


1^8^° 




>F80 


59.7% 


36% 




> F95 


24.0 /o 






<F50 


16.7% 




as measured by Tm of 


> F50 


83.3% 




secondary structure < 20°C 


>F80 


61.1% 


5 OV 




> F95 


24.6 /o 




'^f the senseTs^rand^'*''^" ^ ^ 


< F50 


Qo/ 






> F50 


88 2% 






>F80 


75.0% 


18 9°/ 




> F95 


29.4 /o 


7 7% 


^of the'sense strand'''°" ^ 


<.F50 


y-jiol 














>F80 


62!5% 






>F95 


34.4% 




VI. A U base at position 10 


<F50 


13.9% 


-6.1% 


of the sense strand 


>F50 




6.1% 




>F80 


69!4% 


13.3% 




>F95. 


41.7% 


20% 


VII. A base other than Cat 


<F5a 


18.8% 


-1.2% 


position 19 of the sense strand 


>F50 


81.2% 


1.2% 




>F80 


59.7% 


3.6% 




>F95 


24.2% 


2.5% 


VIII. A base other than Gat 


<F50 


15.2% 


-4.8% 


position 13 of the sense strand 


>F50 


84.8% 


4.8% 




>F80 


61.4% 


5.3% 




>F95 


26.5% 


4.8% 



The siRNA selection algorithm 

In an effort to improve selection further, all identified criteria, including but 
not limited to those listed in Table IV were combined into the algorithms embodied in 
Formula VIII and Formula IX. Each siRNA was then assigned a score (referred to as 
30 a SMARTscore™) according to the values derived from the formulas.- Duplexes that 
scored higher than 0 or 20, for Formulas Vffl and IX, respectively, effectively 
selected a set of functional siRNAs and excluded all non-functional siRNAs. 
Conversely, all duplexes scoring lower than 0 and 20 according to formulas VIII and 
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IX, respectively, contained some functional siKNAs but included all non-functional 
siRNAs. A graphical representation of this selection is shown in Figure 5. 



The methods for obtaining the seven criteria embodied in Table FV are 
5 illustrative of the results of the process used to develop the information for Formulas 
VIII and IX. Thus similar techniques were used to establish the other variables and 
their multipliers. As described above, basic statistical methods were use to determine 
the relative values for these multipliers. 



10 To determine the value for "Improvement over Random" the difference in the 

frequency of a given attribute (e.g. GC content, base preference) at a particular 
position is determined between individual functional groups (e.g. <F50) and the total 
siRNA population studied (e.g. 270 siRNA molecules selected randomly). Thus, for 
instance, in Criterion I (30%-52% GC content) members of the <F50 group were 

1 5 observed to have GC contents between 30-52% in 16.4% of the cases. In contrast, the 
total group of 270 siRNAs had GC contents in this range, 20% of the time. Thus for 
this particular attribute, there is a small negative correlation between 30%-52% GC 
content and this functional group (i.e. 16.4%-20% = -3,6%). Similarly, for Criterion 
VI, (a "U" at position 10 of the sense strand), the >F95 group contained a "U" at this 

20 position 41 .7% of the time. In contrast, the total group of 270 siRNAs had a "U" at 
this position 21 .7% of the time, thus the improvement over random is calculated to be 
20%(or41.7%-21.7%). 

Identifying The Average Internal StabiUtv Profile of Strong siRNA 
25 In order to identify an internal stability profile that is characteristic of strong 

siRNA, 270 different siRNAs derived from the cyclophilin B, the diazepam binding 
inhibitor (DBI), and the luciferase gene were individually transfected into HEK293 
cells and tested for their ability to induce RNAi of the respective gene. Based on their 
performance in the in vivo assay, the sequences were then subdivided into three 
30 groups, (i) >95% silencing; (ii) 80-95% silencing; and (iii) less than 50% silencing. 
Sequences exhibiting 51-84% silencing were eliminated from further consideration to 
reduce the difficulties in identifying relevant thermodynamic patterns. 



wo 2004/045543 PCT/US2003/036787 
42 

Following the division of siRNA into three groups, a statistical analysis was 
performed on each member of each group to determine the average internal stability 
profile (AISP) of the siRNA. To accomplish this the Oligo 5.0 Primer Analysis 
Software and other related statistical packages (e.g. Excel) were exploited to 
5 determine the internal stability of pentamers using the nearest neighbor method 

described by Freier et ai, (1986) Improved free-energy parameters for predictions of 
RNA duplex stability, Proc Natl. Acad. Sci. U. S. A. 83(24): 9373-7. Values for each 
group at each position were then averaged, and the resulting data were graphed on a 
linear coordinate system with the Y-axis expressing the AG (free energy) values in 
10 kcal/mole and the X-axis identifying the position of the base relative to the 5' end. 

The results of the analysis identified multiple key regions in siRNA molecules 
that were critical for successful gene silencing. At the 3 '-most end of the sense strand 
(5'antisense), highly functional siRNA (>95% gene silencing, see Figure 6a, >F95) 

1 5 have a low internal stability (AISP of position 1 9 = ~ -7.6kcal/mol). In contrast low- 
efficiency siRNA (i.e. those exhibiting less than 50% silencing, <F50) display a 
distinctly different profile, having high AG values (~ -8.4kcal/mol) for the same 
position. Moving m a 5' (sense strand) direction, the internal stability of highly 
efficient siRNA rises (position 12 = ~ -8.3kcal/mole) and then drops again (position 7 

20 = ~ -7.7kcal/mol) before leveling off at a value of approximately -8.1keal/mol for the 
5' terminus. SiRNA with poor silencing capabihties show a distinctly different 
profile. While the AISP value at position 12 is nearly identical with that of strong 
siRNAs, the values at positions 7 and 8 rise considerably, peaking at a high of ~ -9.0 
kcal/mol. In addition, at the 5' end of the molecule the AISP profile of strong and 

25 weak siRNA differ dramatically. Unlike the relatively strong values exhibited by 

siRNA in the >95% silencing group, siRNAs that exhibit poor silencing activity have 
weak AISP values (-7.6, -7.5, and -7.5 kcal/mol for positions 1, 2 and 3 respectively). 

Overall the profiles of both strong and weak siRNAs form distinct sinusoidal 
30 shapes that are roughly 1 80° out-of-phase with each other. While these 

thermodynamic descriptions define the archetypal profile of a strong siRNA, it will 
likely be the case that neither the AG values given for key positions in the profile or 
the absolute position of the profile along the Y-axis (i.e. the AG -axis) are absolutes. 
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Profiles that are shifted upward or downward (i.e. having on an average, higher or 
lower values at every position) but retain the relative shape and position of the profile 
along the X-axis can be foreseen as being equally effective as the model profile 
described here. Moreover, it is likely that siKNA that have strong or even stronger 

5 gene-specific silencing effects might have exaggerated AG values (either higher or 
lower) at key positions. Thus, for instance, it is possible that the 5'-most position of 
the sense strand (position 19) could have AG values of 7.4 kcal/mol or lower and still 
be a strong siRNA if, for instance, a G-C -> G-T/U mismatch were substituted at 
position 19 and altered duplex stability. Similarly, position 12 and position 7 could 

10 have values above 8.3 kcal/mol and below 7.7 kcal/mole, respectively, without 

abating the silencing effectiveness of the molecule. Thus, for instance, at position 12, 
a stabilizing chemical modification (e.g. a chemical modification of the 2' position of 
the sugar backbone) could be added that increases the average internal stability at that 
position. Similarly, at position 7, mismatches similar to those described previously 

1 5 could be introduced that would lower the AG values at that position. 

Lastly, it is important to note that while fimctional and non-fimctional siKNA 
were originally defined as those molecules having specific silencing properties, both 
broader or more limiting parameters can be used to define these molecules. As used 

20 herein, unless otherwise specified, "non-fiinctional siRNA" are defined as those 

siRNA that induce less than 50% (<50%) target silencing, "semi-functional siRNA" 
induce 50-79% target silencing, "fimctional siRNA" are molecules that induce 80- 
95% gene silencing, and' "Wghiy-flinctional siRNA" are molecules that induce great 
than 95% gene silencing. These definitions are not intended to be rigid and can vary 

25 depending upon the design and needs of the application. For instance, it is possible 
that a researcher attempting to map a gene to a chromosome using a fimctional assay, 
may identify an siRNA that reduces gene activity by only 30%. While this level of 
gene silencing may be "non-functional" for e.g. therapeutic needs, it is sufficient for 
gene mapping purposes and is, under these uses and conditions, "functional." For 

30 these reasons, functional siRNA can be defined as those molecules having greater 
than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% silencing capabilities at 
lOOnM transfection conditions. Similarly, depending upon the needs of the study 
and/or application, non-fimctional and semi-fimctional siRNA can be defined as 
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having different parameters. For instance, semi-ftmctional siRNA can be defined as 
being those molecules that induce 20%, 30%, 40%, 50%, 60%, or 70% silencing at 
lOOnM transfection conditions. Similarly, non-functional siRNA can be defined as 
being those molecules that silence gene expression by less than 70%o, 60%, 50%, 
5 40%, 30%, or less. Nonetheless, unless otherwise stated, the descriptions stated in 
the "Definitions" section of this text should be applied. 

Functional attributes can be assigned to each of the key positions in the AISP 
of strong siRNA. The low 5' (sense strand) AISP values of strong siRNAs may be 
1 0 necessary for determining which end of the molecule enters the RISC complex. In 

contrast, the high and low AISP values observed in the central regions of the molecule 
may be critical for siRNA-target noRNA interactions and product release, 
respectively. 

15 If the AISP values described above accurately define the thermodynamic 

parameters of strong siRNA, it would be expected that similar patterns would be 
observed in strong siRNA isolated jfrom nature. Natural siRNAs exist in a harsh, 
RNase-rich enviroimient and it can be hypothesized that only those siRNA that 
exhibit heightened afiSnity for RISC (z.e. siRNA that exhibit an average internal 

20 stability profile similar to those observed in strong siRNA) would survive in an. .. 
intracellular environment. This hypothesis was tested using GFP-specific siRNA 
isolated firom N. benthamiana. Llave et al. (2002) Endogenous aitd Silencing- 
Associated Small RNAs in £la?}ts..3he Plant Cell 14, 1605-1619, introduced long 
double-stranded GFP-encoding RNA into plants and subsequently re-isolated GFP- 

25 specific siRNA from the tissues. The AISP of fifty-nine of these GFP-siRN A were 
determined, averaged, and subsequently plotted alongside the AISP profile obtained 
fi:om the cyclophilin B/DBI/ luciferase siRNA having >90% silencing properties 
(Figure 6b). Comparison of the two groups show that profiles are nearly identical. 
This finding validates the information provided by the internal stability profiles and 

30 demonstrates that: (1) the profile identified by analysis of the cyclophilin B/DBl/ 

luciferase siRNAs are not gene specific; and (2) AISP values can be used to search for 
strong siRNAs in a variety of species. 
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Both chemical modifications and base-pair mismatches can be incorporated 
into siRNA to alter the duplex's AISP and functionality. For instance, introduction of 
mismatches at positions 1 or 2 of the sense strand destabilized the 5 'end of the sense 
strand and increases the functionality of the molecule (see Luc, Figure 7). Similarly, 
5 addition of 2'-0-methyl groups to positions 1 and 2 of the sense strand can also alter 
the AISP and (as a result) increase both the functionality of the molecule and 
eliminate off-target effects that results firom sense strand homology with the unrelated 
targets (Figures 8a, 8b). 

10 Rationale for Criteria in a Biological Context 

The fate of siRNA in the RNAi pathway may be described in 5 major steps: 
(1) duplex recognition and pre-RISC complex formation; (2) ATP-dependent duplex 
unwinding/strand selection and RISC activation; (3) mRNA target identification; (4) 
mRNA cleavage, and (5) product release (Figure 1). Given the level of nucleic acid- 
1 5 protein interactions at each step, siRNA functionality is likely influenced by specific 
biophysical and molecular properties that promote efficient interactions within the 
context of the multi-component complexes. Indeed, the systematic analysis of the 
siRNA test set identified multiple factors that correlate well with functionality. When 
combined into a single algorithm, they proved to be very effective in selecting active 
... 20 siRNAs. 

The fectors described here may also be predictive of key functional 
associations important for each stepJnJRNAi. For example, the potential formation of 
internal hairpin structures correlated negatively with siRNA functionality. 

25 Complementary strands with stable internal repeats are more Ukely to exist as stable 
hairpins thus decreasing the effective concentration of the functional duplex form. 
This suggests that the duplex is the preferred conformation for initial pre-RISC 
association. Indeed, although single complementary strands can induce gene 
silencing, the effective concentration required is at least two orders of magnitude 

30 higher than that of the duplex form. 

siRNA-pre-RISC complex formation is followed by an ATP-dependent duplex 
unwinding step and "activation" of the RISC. The siRNA functionality was shown to 
correlate with overall low internal stabihty of the duplex and low internal stability of 
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the 3' sense end (or differentia] internal stability of the 3' sense compare to the 5' 
sense strand), which may reflect strand selection and entry into the RISC. Overall 
duplex stability and low internal stability at the 3' end of the sense strand were also 
correlated with siRNA functionality. Interestingly, siRNAs with very high and very 
5 low overall stability profiles correlate strongly with non-functional duplexes. One 
interpretation is that high internal stability prevents efficient unwinding while very 
low stability reduces siRNA target affinity and subsequent mRNA cleavage by the 
RISC. 



1 0 Several criteria describe base preferences at specific positions of the sense 

strand and are even more intriguing when considering their potential mechanistic 
roles in target recognition-and mRNA cleavage. Base preferences for A at position 19 
of the sense strand but not C, are particularly interesting because they reflect the same 
base preferences observed for naturally occurring miRNA precursors. That is, among 

15 the reported miRNA precursor sequences 75% contain a U at position 1 which 

corresponds to an A in position 19 of the sense strand of siRNAs, while G was under- 
represented in this same position for miRNA precursors. These observations support 
the hypothesis that both miRNA precursors and siRNA dwplexes are processed by 
very similar if not identical protein machinery. The fimctional interpretation of the 

20. predominance of a U/A base pair is that it promotes flexibility at the 5 ' antisense ends " 
of both siRNA duplexes and miRNA precursors and facilitates efficient imwinding 
and selective strand entrance into an activated RISC. 

Among the criteria associated with base preferences that are likely to 
25 influence mRNA cleavage or possibly product release, the preference for U at 
position 1 0 of the sense strand exhibited the greatest impact, enhancing the 
probabiUty of selecting an F80 sequence by 13.3%. Activated RISC preferentially 
cleaves target mRNA between nucleotides 10 and 1 1 relative to the 5' end of the 
complementary targeting strand. Therefore, it may be that U, the preferred base for 
30 most endoribonucleases, at this position supports more efficient cleavage. 

Alternatively, a U/A bp between the targeting siRNA strand and its cognate target 
mRNA may create an optimal conformation for the RISC-associated "slicing" 
activity. 
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Pooling 

According to another embodiment, the present invention provides a pool of at 
least two siRNAs, preferably in the form of a kit or therapeutic reagent, wherein one 
strand of each of the siRNAs, the sense strand comprises a sequence that is 
5 substantially similar to a sequence within a target mRNA. The opposite strand, the 
antisense strand, will preferably comprise a sequence that is substantially 
complementary to that of the target mRNA. More preferably, one strand of each 
siRNA will comprise a sequence that is identical to a sequence that is contained in the 
target mRNA. Most preferably, each siRNA will be 19 base pairs in length, and one 
1 0 strand of each of the siRNAs will be 100% complementary to a portion of the target 
mRNA. 

By increasing the number of siRNAs directed to a particular target using a 
pool or kit, one is able both to increase the likelihood that at least one siRNA with 

1 5 satisfectory ftinctionality wDl be included, as well as to benefit firom additive or 

synergistic effects. Further, when two or more siRNAs directed against a single gene 
do not have satisfactory levels of ftinctionality alone, if combined, they may 
satisfactorily promote degradation of the target messenger RNA and successfuUy 
inhibit translation. By including multiple siiRNAs in the system, not only is the 

20 . : probability of silencing increased, but the economics of operation are also improved 
when compared to adding differeat siRNAs sequentially. This effect is contrary to the 
conventional wisdom that the concurrent use of multiple siRNA will negatively 
impact gene silencing (e.g. Holen, T. eta7oi(2Q03) "Similar behavior of single strand 
and double strand siRNAs suggests they act through a common RNAi pathway." 

25 NAR 31: 2401-21407). 

In fact, when two siRNAs were pooled together, 54% of the pools of two 
siRNAs induced more than 95% gene silencing. Thus, a 2.5-fold increase in the 
percentage of functionality was achieved by randomly combining two siRNAs. 
30 Further, over 84% of pools containing two siRNAs induced more than 80% gene 
silencing. 

More preferably, the kit is comprised of at least three siRNAs, wherein one 
strand of each siRNA comprises a sequence that is substantially similar to a sequence 
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of the target mRNA and the other strand comprises a sequence that is substantially 
complementary to the region of the target mRNA. As with the kit that comprises at 
least two siRNAs, more preferably one strand will comprise a sequence that is 
identical to a sequence that is contained in the mRNA and another strand that is 100% 
5 complementary to a sequence that is contained in the mRNA. During experiments, 
when three siRNAs were combined together, 60% of the pools induced more than 
95% gene silencing and 92% of the pools induced more than 80% gene silencing. 

Further, even more preferably, the kit is comprised of at least four siRNAs, 
1 0 wherein one strand of each siRNA comprises a sequence that is substantially similar 
to a region of the sequence of the target mRNA, and the other strand comprises a 
sequence that is substantially complementary to the region of the target mRNA. As 
with the kit or pool that comprises at least two siRNAs, more preferably one strand of 
each of the siRNA duplexes will comprise a sequence that is identical to a sequence 
15 that is contained in the mRNA, and another strand that is 100% complementary to a 
sequence that is contained in the mRNA. 



Additionally, kits and pools with at least five, at least six, and at least seven 
. siRNAs may also be useful with the present invention. For example, pools of five 
20 siRNA induced 95% gene silencing with 77% probability and 80% silencing with 
98.8% probability. Thus, pooUng of siRNAs together can result in the creation of a 
target-specific silencing reagent with almost a 99% probability of being functional. 
The fact that such high levels of success are aehievable using such pools of siRNA, 
enables one to dispense with costly and time-consuming target-specific validation 
25 procedures. 



For this embodiment, as well as the other aforementioned embodiments, each 
of the siRNAs within a pool will preferably comprise between 1 8 and 30 base pairs, 
more preferably between 1 8 and 25 base pairs, and most preferably 1 9 base pairs. 
30 Within each siRNA, preferably at least 1 8 contiguous bases of the antisense strand 
will be 100% complementary to the target mRNA. More preferably, at least 19 
contiguous bases of the antisense strand will be 1 00% complementary to the target 
mRNA. Additionally, there may be overhangs on eitihier the sense strand or the 
antisense strand, and these overhangs may be at either the 5' end or the 3' end of 
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either of the strands, for example there may be one or more overhangs of 1 -6 bases. 
When overhangs are present, they are not included in the calculation of the number of 
base pairs. The two nucleotide 3' overhangs mimic natural siRNAs and are 
commonly used but are not essential. Preferably, the overhangs should consist of two 

5 nucleotides, most often dTdT or UU at the 3 ' end of the sense and antisense strand 
that are not complementary to the target sequence. The siRNAs may be produced by 
any method that is now known or that comes to be known for synthesizing double 
stranded RNA that one skilled in the art would appreciate would be useful in the 
present invention. Preferably, the siRNAs will be produced by Dharmacon's 

10 proprietary ACE® technology. However, other methods for synthesizing siRNAs are 
well known to persons skilled in the art and include, but are not limited to, any 
chemical synthesis of RNA oligonucleotides, ligation of shorter oligonucleotides, in 
vitro transcription of RNA oligonucleotides, the use of vectors for expression within 
cells, recombinant Dicer products and PCR products. 

15 

The siRNA duplexes within the aforementioned pools of siRNAs may 
correspond to overlapping sequences within a particular mRNA, or non-overlapping 
sequences of the mRNA. However, preferably they correspond to non-overlapping 
sequences. Further, each siRNA may be selected randomly, or one or more of the 
20 siRNAlhay be selected according to the criteria discussed above for maximizing the 
effectiveness of siRNA. 

Included in the delBnition of siRNAs {uresiR^lAs that contain substituted 
and/or labeled nucleotides that may, for example, be labeled by radioactivity, 

25 fluorescence or mass. The most common substitutions are at the 2' position of the 

ribose sugar, where moieties such as H (hydrogen) F, NH3, OCH3 and other O- alkyl, 
alkenyl, alkynyl, and orthoesters, may be substituted, or in the phosphorous backbone, 
where sulfiir, amines or hydrocarbons may be substituted for the bridging of non- 
bridging atomsjn the phosphodiester bond. Examples of modified siRNAs are 

30 explained more fully in commonly assigned U.S. Patent Application Ser. No. 
10/613,077, filed July 1, 2003, which is incorporated by reference herein. 



Additionally, as noted above, the cell type into which the siRNA is introduced 
may affect the ability of the siRNA to enter the cell; however, it does not appear to 
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affect the ability of the siRNA to function once it enters the cell. Methods for 
introducing double-stranded RNA into various cell t3^es are well known to persons 
skilled in the art. 

5 As persons skilled in the art are aware, in certain species, the presence of 

proteins such as RdRP, the RNA-dependent RNA polymerase, may catalytically 
enhance the activity of the siRNA. For example, RdRP propagates the RNAi effect in 
C. elegans and other non-mammalian organisms. In fact, in organisms that contain 
these protems, the siRNA may be inherited. Two other proteins that are well studied 
1 0 and known to be a part of the machinery are members of the Argonaute family and 
Dicer, as well as their homologues. There is also initial evidence that the RISC 
complex might be associated with the ribosome so the more efficiently translated 
mRNAs will be more susceptible to silencing than others. 

1 5 Another very important factor in the efficacy of siRNA is mRNA localization. 

In general, only cytoplasmic mRNAs are considered to be accessible to RNAi to any 
appreciable degree. However, appropriately designed siRNAs, for example, siRNAs 
modified with intemucleotide linkages, may be able to cause silencing by acting in the 
nucleus. Examples of these types of modifications are described in commonly 

20 assigned U.S. Patent Application Serial Nos. 10/431,027 and 10/613,077, each of 
which is incorporated by reference herein. 

As described above, even when one select&atJeasttwo siRNAs at random, the 
effectiveness of the two may be greater than one would predict based on the 

25 effectiveness of two individual siRNAs. This additive or synergistic effect is 
particularly noticeable as one increases to at least three siRNAs, and even more 
noticeable as one moves to at least four siRNAs. Surprisingly, the pooling of the non- 
functional and semi-functional siRNAs, particularly more than five siRNAs, can lead 
to a silencing mixture that is as effective if not more effective than any one particular 

30 fimctional siRNA. 
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Within the kit of the present invention, preferably each siRNA will be present 
in a concentration of between 0.001 and 200 nM, more preferably between 0.01 and 
200 nM, and most preferably between 0.1 and 10 nM. 

5 In addition to preferably comprising at least four or five siRNAs, the kit of the 

present invention will also preferably comprise a buffer to keep the siRNA duplex 
stable. Persons skilled in the art are aware of buffers suitable for keeping siRNA 
stable. For example, the buffer may be comprised of 1 00 mM KCl, 30 mM HEPES- 
pH 7.5, and 1 mM MgC^. Alternatively, kits might contain complementary strands 
1 0 that contain any one of a number of chemical modifications (e.g. a 2'-0-ACE) that 
protect the agents firom degradation by nucleases. In this instance, the user may (or 
may not) remove the modifying protective grWup (e.g. deprotect) before annealing the 
two complementary strands together. 

1 5 By way of example, the kit may be organized such that pools of siRNA 

duplexes are provided on an array or microarray of wells or drops for a particular 
gene set or for unrelated genes. The array may, for example, be in 96 wells, 384 wells 
or 1284 wells arrayed in a plastic plate or on a glass slide using techniques now 
known or that come to be known to persons skilled in the art. Within an array, 

20 preferably therd^ill be controls such as fimctional anti-lamin A/C, eyclophilin and 
two siRNA duplexes that are not specific to the gene of interest. 

In order to ensure stability of the siRNA poolsipH0i4G»usage, they may be 
retained in lyophilized form at minus twenty degrees (— 20°C) until they are ready for 

25 use. Prior to usage, they should be resuspended; however, even once resuspended, for 
example, in the aforementioned buffer, they should be kept at minus twenty degrees, 
(— 20°C) until used. The aforementioned buffer, prior to use, may be stored at 
approximately 4°C or room temperature. Effective temperatures at which to conduct 
transfections are well known to persons skilled in the art and include for example, 

30 room temperature. 

The kit may be applied either in vivo or in vitro. Preferably, tfie siRNA of the 
pools or kits is applied to a cell through transfection, employing standard transfection 
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protocols. These methods are well known to persons skilled in the art and include the 
use of lipid-based carriers, electroporation, cationic carriers, and microinjection. 
Further, one could apply the present invention by synthesizing equivalent DNA 
sequences (either as two separate, complementary strands, or as hairpin molecules) 
5 instead of siRNA sequences and introducing theni into cells through vectors. Once in 
the cells, the cloned DNA could be transcribed, thereby forcing the cells to generate 
the siRNA. Examples of vectors suitable for use with the present application include 
but are not limited to the standard transient expression vectors, adenoviruses, 
retroviruses, lentivirus-based vectors, as well as other traditional expression vectors. 
1 0 Any vector that has an adequate siRNA expression and procession module may be 
used. Furthermore, certain chemical modifications to siRNAs, including but not 
limited to conjugations to other molecules, may be used to facilitate delivery. For 
certain applications it may be preferable to deliver molecules without transfection by 
simply fonnulating in a physiological acceptable solution. 

15 

This embodiment may be used in connection with any of the aforementioned 
embodiments. Accordingly, the sequences within any pool may be selected by 
rational design. 

20 

Multigene Silencing 

In addition to developing kits that contain multiple siRNA directed against a 
single gene, another embodiment includes the use of multiple^sdRNA targeting 
multiple genes. Multiple genes may be targeted through the use of high- or hyper- 

25 functional siRNA. High- or hyper- functional siRNA that exhibit increased potency, 
require lower concentrations to induce desired phenotypic (and thus therapeutic) 
effects. This circumvents RISC saturation. It therefore reasons that if lower 
concentrations of a single siRNA are needed for knockout or knockdown expression 
of one gene, then the remaining (imcomplexed) RISC will be fi-ee and available to 

30 interact vdth siRNA directed against two, three, four, or more, genes. Thus in this 
embodiment, the authors describe the use of highly functional or hyper-functional 
siRNA to knock out three separate genes. More preferably, such reagents could be 
combined to knockout four distinct genes. Even more preferably, highly functional or 
hyperfunctional siRNA could be used to knock out five distinct genes. Most 
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preferably, siRNA of this type could be used to knockout or knockdown the 
expression of six or more genes. 



Hyperfunctional slRNA 
5 The term hyperfunctional siRNA (hf-siRNA) describes a subset of the siRNA 

population that induces RNAi in cells at low- or sub-nanomolar concentrations for 
extended periods of time. These traits, heightened potency and extended longevity of 
the RNAi phenotype, are highly attractive from a therapeutic standpoint. Agents 
having higher potency require lesser amounts of the molecule to achieve the desired 
1 0 physiological response, thus reducing the probability of side effects due to "off- 
target" interference. In addition to the potential therapeutic benefits associated with 
hyperfunctional siRNA, hf-siRNA are also desirable'from an economic position. 
Hyperfunctional siRNA may cost less on a per-treatment basis, thus reducing the 
overall expenditures to both the manufacturer and the consumer. 

1 5 Identification of hyperfunctional siRNA involves multiple steps that are 

designed to examine an individual siRNA agent's concentration- and/or longevity- 
profiles. In one non-limiting example, a population of siRNA directed against a single 
gene are first analyzed using the previously described algorithm (Formula VIII). 
Individual siRNA are then introduced into a test cell line and assessed for the ability 

20 to degrade the target mRNA. It is important to note that when perfoniiing this step it 
is not necessary to test all of the siRNA. Instead, it is sufficient to test only those 
siRNA having the highest SMARTscores™ (i.e. SMARTscore™ > -10). 
Subsequently, the gene silencing data is plotted against the SMARTscores™ (see 
Figure 9). SiRNA that (1) induce a high degree of gene silencing {i.e. they induce 

25 greater than 80% gene knockdown) and (2) have superior SMARTscores™ (i.e. a 

SMARTscore™ of > -10, suggesting a desirable average internal stability profile) are 
selected for further investigations designed to better understand the molecule's 
potency and longevity. In one, non-limiting study dedicated to understanding a 
molecule's potency, an siRNA is introduced into one (or more) cell types in 

30 increasingly diminishing concentrations (e.g. 3.0 -> 0.3 nM). Subsequently, the level 
of gene silencing induced by each concentration is examined and siRNA that exhibit 
hyperfunctional potency (i.e. those that induce 80% silencing or greater at e.g. 
picomolar concentrations) are identified. In a second study, the longevity profiles of 
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siRNA having high (>-l 0) SMARTscores™ and greater than 80% silencing are 
examined, hi one non-limiting example of how this is achieved, siRNA are introduced 
into a test cell line and the levels of KNAi are measured over an extended period of 
time (e.g. 24-168 hrs). SiR^JAs that exhibit strong RNA interference patterns (i.e. 
5 >80 % interference) for periods of time greater than, e.g., 1 20 hours are thus 

identified. Studies similar to those described above can be performed on any and all 
of the > 10* siRNA included in this document to ftirther define the most ftinctional 
molecule for any given gene. Molecules possessing one or both properties (extended 
longevity and heightened potency) are labeled "hyperfbnctional siRNA," and 
1 0 earmarked as candidates for future therapeutic studies. 

While the example(s) given above descaibe one means by which 
hyperfimctional siRNA can be isolated, neither the assays themselves nor the 
selection parameters used are rigid and can vary with each family of siRNA. Families 
1 5 of siRNA include siRNAs directed against a single gene, or directed against a related 
fanoily of genes. 

The highest quality siRNA achievable for any given gene may vary 
considerably. Thus, for example, in the case of one gene (gene X), rigorous studies 

20 such as those described above may enable the identification of an siRNA that, at 
picomolar concentrations, induces 99^% silencing for a period of 10 days. Yet 
identical studies of a second gene (gene Y) may yield an siRNA that at high 
nanomolar concentrations (e.g. lOOnM) induces only 75% silencing fox^period of 2 
days. Both molecules represent the very optimum siRNA for their respective gene 

25 targets and therefore are designated "hyperfunctional." Yet due to a variety of factors 
including but not limited to target concentration, slKNA stability, cell type, off-target 
interference, and others, equivalent levels of potency and longevity are not 
achievable. Thus, for these reasons, the parameters described in the before mentioned 
assays, can vary. While the initial screen selected siRNA that had SMARTscores™ 

30 above -1 0 and a gene silencing capabiUty of greater than 80%, selections that have 
stronger (or weaker) parameters can be implemented. Similarly, in the subsequent 
studies designed to identify molecules with high potency and longevity, the desired 
cutoff criteria (i.e. the lowest concentration that induces a desirable level of 
interference, or the longest period of time that interference can be observed) can vary. 
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The experimentation subsequent to application of the rational criteria of this 
application is significantly reduced where one is trying to obtain a suitable 
hyperfunctional siRNA for, for example, therapeutic use. When, for example, the 
additional experimentation of the type described herein is applied by one skiUed in the 
art with this disclosure in hand, a hyperfunctional siRNA is readily identified. 

The siRNA may be introduced into a cell by any method that is now known or 
that comes to be known and that firom reading this disclosure, persons skiUed in the 
art would determine would be useful in connection with the present invention in 
enabling siRNA to cross the cellular membrane. These methods include, but are not 
limited to, any manner of transfection, such as for example transfection employing 
DEAE-Dextran, calcium phosphate, cationic lipids/liposomes; micelles, manipulation 
of pressure, microinjection, electroporation, immunoporation, use of vectora such as 
viruses, plasmids, cosmids, bacteriophages, cell fiisions, and coupling of the 
polynucleotides to specific conjugates or ligands such as antibodies, antigens, or 
receptors, passive introduction, adding moieties to the siKNA that facilitate its uptake, 
and the like. 

Havmg described the invention with a degree of particularity, examples will 
now be provided. These examples are not intended to and should not be construed to 
limit the scope of the claims in any way. 

Examples .s^^: -ui...... 

General Techniques and Nomenclatures 

siRNA nomenclature. All siRNA duplexes are referred to by sense strand. The first 
nucleotide of the 5'-end of the sense strand is position 1, which corresponds to 
position 19 of the antisense strand for a 19-mer. hi most cases, to compare results 
firom different experiments, silencing was determined by measuring specific transcript 
mRNA levels or enzymatic activity associated with specific transcript levels, 24 hours 
post-transfection, with siRNA concentrations held constant at 100 nM. For all 
experiments, unelss otherwise specified transfection efficiency was ensured to be over 
95%, and no detectable cellular toxicity was observed. The following system of 
nomenclature was used to compare and report siRNA-silendng fiinctionaUty: "F" 
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followed by the degree of minimal knockdown. For example, F50 signifies at least 
50% knockdown, F80 means at least 80%, and so forth. For this study, all sub-F50 
siRNAs were considered non-functional. 

5 Cell culture and transfection. 96-well plates are coated with 50 |j,l of 50 mg/ml poly- 
L-lysine (Sigma) for 1 hr, and then washed 3X with distilled water before being dried 
for 20 min. HEK293 cells or HEK293Lucs or any other cell type of interest are 
released from their solid support by trypsinization, diluted to 3.5 X 10^ cells/ml, 
followed by the addition of 100 fJ.L of cells/well. Plates are then incubated overnight 

10 at 37° C, 5% CO2. Transfection procedures can vary widely depending on the cell 
type and transfection reagents. In one non-limitmg example, a transfection mixture 
consisting of 2 mL Opti-MEM I (Gibco-BRL), 80 ^il Lipofectanime 2000 
(Invitrogen), 15 jiL SUPERNasin at 20 U/(il (Ambion), and 1.5 |il of reporter gene 
plasmid at 1 \Lg/ ]xl is prepared in 5-ml polystyrene round bottom tubes. 100 ^1 of 

1 5 transfection reagent is then combined with 100 n\ of siRNAs in polystyrene deep-well 
titer plates (Beckman) and incubated for 20 to 30 min at room temp. 550 |al of Opti- 
MEM is then added to each well to bring the final siRNA concentration to 100 nM. 
Plates are then sealed with parafilm and mixed. Media is removed from HEK293 
cells and replaced with 95 pj of transfection mixture. Cells are incubated overnight at 

20 37°C,5%C02. 

Quantification of gene knockdown. A variety of quantification procedures can be 
used to measure the level of silencing induced by siRNA or siRNA pools. In one non- 
limiting example: to measure mRNA levels 24 hrs post-transfection, QuantiGene 
25 branched-DNA (bDNA) kits (Bayer) (Wang, et al. Regulation of insulin preRNA 
splicing by glucose. Proc Natl Acad Sci 1997, 94:4360.) are used according to 
manufacturer instructions. To measure luciferase activity, media is removed from 
HEK293 cells 24 hrs post-transfection, and 50 [d of Steady-GLO reagent (Promega) 
is added. After 5 min, plates are analyzed on a plate reader. 



Example I. Sequences Used to Develop the Algorithm. 

Anti-Firefly and anti-Cyclophilin siRNAs panels (Figure 5a, b) sorted 
according to using Formula VIII predicted values. All siRNAs scoring more than 0 
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(formula VUl) and more then 20 (formula IX) are folly fonctional. All ninety 
sequences for each gene (and DBI) appear below in Table III. 



TABLE m 



Cyclo 


1 


SEQ. ID 0032 


GUUCCAAAAACAGUGGAUA 


Cyclo 


2 


SEQ. ID 0033 


UCCAAAAACAGUGGAUAAU 


c^il 


3 


SEQ. ID 0034 






4 


SEQ. ID 0035 


AAAACAGUGGAUAAUUUUG 


Cyclo 




SEQ. ID 0036 


AACAGUGGAUAAUUUUGUG 


Cyclo 


5 


SEQ. ID 0037 


CAGUGGAUAAUUUUGUGGC 


C^cll 




SEQ ID 0038 


GUGGAUAAUUUUGUGGCCU 


C^clo 






GGAUAAUUUUGUGGCCUUA 


c!cll 






AUAAUUUUGUG GCCUUAGC 


Cvcll 


10 


OCV^. JUL/ VVt 1 


AAUUUUGUGGCCUUAGCUA 








UUUUGUGGCCUUAGCUACA 


° 




SEQ. ID 0043 


UUGUGGCCUUAGCUACAGG 






OCv^. IXJ UU*W 


GUGGCCUUAGCUACAGGAG 


/V 




SEQ. ID 0045 


GGCCUUAGCUACAGGAGAG 


c^il 






CCUUAGCUACAGGAGAGAA 


C^lo 


16 


dCV^. IIJ uu*f / 


UUAGCUACAGGAGAGAAAG 


C^Ilo 


17 


SEQ ID 0048 


AGCUACAGGAGAGAAAGGA 




18 


Qpri TFi AA^O 


CUACAGGAGAGAAAGGAUU 


Cyclo 


19 


SEQ ID 0050 


ACAGGAGAGAAAGGAUUUG 


Cycll 


20 


SEQ ID 0051 


AGGAGAGAAAGGAUUUGGC 




21 


SEQ ID 0052 


GAGAGAAAGGAUUUGGCUA 


Cyclo 


22 


SEQ ID 0053 


GAGAAAGGAUUUGGCUACA 




23 


SEQ ID 0054 


GAAAGGAUUUGGCUACAAA 


Cycll 


24 


SEQ ID 0055 


AAGGAUUUGGCUACAAAAA 


Cyclo 


25 


SEQ. ID 0056 


GGAUUUGGCUACAAAAACA 


Cyclo 


26 


SEQ. ID 0057 


AUUUGGCUACAAAAACAGC 


Cyclo 


27 


SEQ. ID 0058 


UUGGCUACAAAAACAGCAA 


Cyclo 


28 


SEQ. ID 0059 


GGCUACAAAAACAGCAAAU 


Cyclo 


29 


SEQ. ID 0060 


CUACAAAAACAGCAAAUUC 


Cyclo 


30 


SEQ. ID 0061 ■ 


ACAAAAACAGCAAAUUCCA 


Cyclo 


31 


SEQ. ID 0062 


AAAAACAGCAAAUUCCAUC 


Cyclo 


32 


SEQ. ID 0063 


AAACAGCAAAUUCCAUCGU 


Cyclo 


33 


SEQ. ID 0064 


ACAGCAAAUUCCAUCGUGU 


Cyclo 


34 


SEQ. ID 0065 


AGCAAAUUCCAUCGUGUAA 
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Cyclo 


35 


SEQ. ID 0066 


Cyclo 


36 


SEQ. ID 0067 


Cyclo 


37 


SEQ. ID 0068 


Cyclo 


38 


SEQ. ID 0069 


Cyclo 


39 


SEQ. ID 0070 


Cyclo 


40 


SEQ. ID 0071 . 


Cyclo 


41 


SEQ. ID 0072 


Cyclo 


42 


SEQ. ID 0073 


Cyclo 


43 


SEQ. ID 0074 


Cyclo 


44 


SEQ. ID 0075 


Cyclo 


45 


SEQ. ID 0076 


Cyclo 


46 


SEQ. ID 0077 


Cyclo 


47 


SEQ. ID 0078 


Cyclo 


48 


SEQ. ID 0079 


Cyclo 


49 


SEQ. ID 0080 


Cyclo 


50 


SEQ. ID 0081 


Cyclo 


51 


SEQ. ID 0082 


Cyclo 


52 


SEQ. ID 0083 


Cyclo 


53 


SEQ. ID 0084 


Cyclo 


54 


SEQ. ID 0085 


Cyclo 


55 


SEQ. ID 0086 


Cyclo 


56 


SEQ. ID 0087 


Cyclo 


57 


SEQ. ID 0088 


Cyclo 


58 


SEQ. ID 0089 


Cyclo 


59 


SEQ. ED 0090 


Cyclo 


60 


SEQ. ID 0091 


Cyclo 


61 


SEQ. ID 0092 


Cyclo 


62 


SEQ. ID 0093 


Cyclo 


63 


SEQ. ID 0094 


Cyclo 


64 


SEQ. ID 0095 


Cyclo 


65 


SEQ. ID 0094 


Cyclo 


66 


SEQ. ID 0096 


Cyclo 


67 


SEQ. ID 0097 


Cyclo 


68 


SEQ. ID 0098 


Cyclo 


69 


SEQ. ID 0099 


Cyclo 


70 


SEQ. ID 0100 


Cyclo 


71 


SEQ. ID 0101 


Cyclo 


72 


SEQ. ID 0102 


Cyclo 


73 


SEQ. ID 0103 



CAAAUUCCAUCGUGUAAUC 
AAUUCCAUCGUGUAAUCAA 
UUCCAUCGUGUAAUCAAGG 
CCAUCGUGUAAUCAAGGAC 
AUCGUGUAAUCAAGGACUU 
CGUGUAAUCAAGGACUUCA 
UGUAAUCAAGGACUUCAUG 
UAAUCAAGGACUUCAUGAU 
AUCAAGGACUUCAUGAUCC 
CAAGGACUUCAUGAUCCAG 
AGGACUUCAUGAUCCAGGG 
GACUUCAUGAUCCAGGGCG 
CUUCAUGAUCCAGGGCGGA 
UCAUGAUCCAGGGCGGAGA 
AUGAUCCAGGGCGGAGACU 
GAUCCAGGGCGGAGACUUC 
UCCAGGGCGGAGACUUCAC 
CAGGGCGGAGACUUCACCA 
GGGCGGAGACUUCACCAGG 

GCGGAGACUUCACCAGGGG 
GGAGACUUCACCAGGGGAG 
AGACUUCACCAGGGGAGAU 
ACUUCACCAGGGGAGAUGG 
UUCACCAGGGGAGAUGGCA 
CACCAGGGGAGAUGGCACA 
CCAGGGGAGAUGGCACAGG 
AGGGGAGAUGGCACAGGAG 
GGGAGAUGGCACAGGAGGA 
GAGAUGGCACAGGAGGAAA 
GAUGGCACAGGAGGAAAGA 
UGGCACAGGAGGAAAGAGC 
GCACAGGAGGAAAGAGCAU 
ACAGGAGGAAAGAGCAUCU 
AGGAGGAAAGAGCAUCUAC 
GAGG AAAG AGC AUCUACGG 
GGAAAGAGCAUCUACGGUG 
AAAGAGCAUCUACGGUGAG 
AGAGCAUCUACGGUGAGCG 
AGCAUCUACGGUGAGCGCU 
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Cyclo 


74 


SEQ. ID 0104 


Cyclo 


75 


SEQ. ID 0105 


Cyclo 


76 


SEQ. ID 0106 


Cyclo 


77 


SEQ. ID 0107 


Cyclo 


78 


SEQ. ID 0108 


Cyclo 


79 


SEQ. ID 0109 


Cyclo 


80 


SEQ. ID 0110 


Cyclo 


81 


SEQ. ID 01 11 


Cyclo 


82 


SEQ. ID 01 12 


Cyclo 


83 


SEQ. ID 01 13 


Cyclo 


84 


SEQ. ID 0114 


Cyclo 


85 


SEQ. ID 01 15 


Cyclo 


86 


SEQ. ID 0116 


Cyclo 


87 


SEQ. ID 01 17 


Cyclo 


88 


SEQ. ID 01 18 


Cyclo 


89 


SEQ. ID 01 19 


Cyclo 


90 


SEQ. ID 0120 


DB 


1 


SEQ. ID 0121 


DB 


2 


SEQ. ID 0122 


DB 


3 


SEQ. ID 0123 


DB 


4 


SEQ. ID 0124 


DB 


5 


SEQ. ID 0125 


DB 


6 


SEQ. ID 0126 


DB 


7 


SEQ. ID 0127 


DB 


8 


SEQ. ID 0128 


DB 


9 . 


SEQ. ID 0129 


DB 


10 


SEQ. ID 0130 


DB 


11 


SEQ. ID 0131 


DB 


12 


SEQ. ID 0132 


DB 


13 


SEQ. ID 0133 


DB 


14 


SEQ. ID 0134 


DB 


15 


SEQ. ID 0135 


DB 


16 


SEQ. ID 0136 


DB 


17 


SEQ. ID 0137 


DB 


18 


SEQ. ID 0138 


DB 


19 


SEQ. ID 0139 


DB 


20 


SEQ. ID 0140 


DB 


21 


SEQ. ID 0141 


DB 


22 


SEQ. ID 0142 


DB 


23 


SEQ. ID 0143 



CAUCUACGGUGAGCGCUUC 
UCUACGGUGAGCGCUUCCC 
UACGGUGAGCGCUUCCCCG 
CGGUGAGCGCUUCCCCGAU 
GUGAGCGCUUCCCCGAUGA 
GAGCGCUUCCCCGAUGAGA 
GCGCUUCCCCGAUGAGAAC 
GCUUCCCCGAUGAGAACUU 
UUCCCCGAUGAGAACUUCA 
CCCCGAUGAGAACUUCAAA 
CCGAUGAGAACUUCAAACU 
GAUGAGAACUUCAAACUGA 
UGAGAACUUCAAACUGAAG 
AGAACUUCAAACUGAAGCA 
AACUUCAAACUGAAGCACU 
CUUCAAACUGAAGCACUAC 
UCAAACUGAAGCACUACGG 
ACGGGCAAGGCCAAGUGGG 
CGGGCAAGGCCAAGUGGGA 
GGGCAAGGCCAAGUGGGAU 
GGCAAGGCCAAGUGGGAUG 
GCAAGGCCAAGUGGGAUGC 
CAAGGCCAAGUGGGAUGCC 
AAGGCCAAGUGGGAUGCCU 
AGGCCAAGUGGGAUGCCUG 
GGCCAAGUGGGAUGCCUGG 
GCCAAGUGGGAUGCCUGGA 
CCAAGUGGGAUGCCUGGAA 
CAAGUGGGAUGCCUGGAAU 
AAGUGGGAUGCCUGGAAUG 
AGUGGGAUGCCUGGAAUGA 
GUGGGAUGCCUGGAAUGAG 
UGGGAUGCCUGGAAUGAGC 
GGGAUGCCUGGAAUGAGCU 
GGAUGCCUGGAAUGAGCUG 
GAUGCCUGGAAUGAGCUGA 
AUGCCUGGAAUGAGCUGAA 
UGCCUGGAAUGAGCUGAAA 
GCCUGGAAUGAGCUGAAAG 
CCUGGAAUGAGCUGAAAGG 
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DB 


24 


SEQ. ID 0144 


DB 


25 


SEQ. ID 0145 


DB 


26 


SEQ. ID 0146 


DB 


27 


SEQ. ID 0147 


DB 


28 


SEQ. ID 0148 


DB 


29 


SEQ. ID 0149 


DB 


30 


SEQ. ID 0150 


DB 


31 


SEQ. ID 0151 


DB 


32 


SEQ. ID 0152 


DB 


33 


SEQ. ID 0153 


DB 


34 


SEQ. ED 0154 


DB 


35 


SEQ. ID 0155 


DB 


36 


SEQ. ID 0156 


DB 


37 


SEQ. ID 0157 


DB 


38 


SEQ. ID 0158 


DB 


39 


SEQ. ID 0159 


DB 


40 


SEQ. ID 0160 


DB 


.41 


SEQ. ID 0161 


DB 


42 


SEQ. ID 0162 


DB 


43 


SEQ. ID 0163 


DB 


44 


SEQ. ED 0164 


DB 


45 


SEQ. ID 0165 


DB 


46 


SEQ. ID 0166 


DB 
DB 


47 
48 


SEQ. ID 0167 
SEQ. ID 0168 


DB 


49 


SEQ. ID 0169 


DB 


50 


SEQ. ID 0170 


DB 


51 


SEQ. ID 0171 


DB 


52 


SEQ. ID 0172 


DB 


53 


SEQ. ID 0173 


DB 


54 


SEQ. ID 0174 


DB 


55 


SEQ. ID 0175 


DB 


56 


SEQ. ID 0176 


DB 


57 


SEQ. ID 0177 


DB 


58 


SEQ. ED 0178 


DB 


59 


SEQ. ID 0179 


DB 


60 


SEQ. ID 0180 


DB 


61 


SEQ. ID 0181 


DB 


62 


SEQ. ID 0182 


DB 


63 


SEQ. ID 0183 



CUGGAAUGAGCUGAAAGGG 
UGGAAUGAGCUGAAAGGGA 
GGAAUGAGCUGAAAGGGAC 
GAAUGAGCUGAAAGGGACU 
AAUGAGCUGAAAGGGACUU 
AUGAGCUGAAAGGGACUUC 
UGAGCUGAAAGGGACUUCC 
GAGCUGAAAGGGACUUCCA 
AGCUGAAAGGGACUUCCAA 
GCUGAAAGGGACUUCCAAG 
CUGAAAGGGACUUCCAAGG 
UGAAAGGGACUUCCAAGGA 
GAAAGGGACUUCCAAGGAA 
AAAGGGACUUCCAAGGAAG 
AAGGGACUUCCAAGGAAGA 
AGGGACUUCCAAGGAAGAU 
GGGACUUCCAAGGAAGAUG 
GGACUUCCAAGGAAGAUGC 
GACUUCCAAGGAAGAUGCC 
ACUUCCAAGGAAGAUGCCA 
CUUCCAAGGAAGAUGCCAU 
UUCCAAGGAAGAUGCCAUG 
UCCAAGGAAGAUGCCAUGA 
CGAAGGAAGAUGCCAUGAA 
CAAGGAAGAUGCCAUGAAA 
AAGGAAGAUGCCAUGAAAG 
AGGAAGAUGCCAUGAAAGC 
GGAAGAUGCCAUGAAAGCU 
GAAGAUGCCAUGAAAGCUU 
AAGAUGCCAUGAAAGCUUA 
AGAUGCCAUGAAAGCUUAC 
GAUGCCAUGAAAGCUUACA 
AUGCCAUGAAAGCUUACAU 
UGCCAUGAAAGCUUACAUC 
GCCAUGAAAGCUUACAUCA 
CCAUG AAAG CUUAC AUCAA 
CAUGAAAGCUUACAUCAAC 
AUGAAAGCUUACAUCAACA 
UGAAAGCUUACAUCAACAA 
GAAAGCUUACAUCAACAAA 
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DB 


64 


SEQ.ID0184 


DB 


65 


SEQ.1D0185 


DB 


65 


SEQ.ID0I86 


DB 


67 


SEQ. ID 0187 


DB 


68 


SEQ.ID0188 


DB 


69 


SEQ. ID 0189 


DB 


70 


SEQ. ID 0190 


DB 


71 


SEQ. ID 0191 


DB 


72 


SEQ. ID 0192 


DB 


73 


SEQ. ID 0193 


DB 


74 


SEQ. ID 0194 


DB 


75 


SEQ. ID 0195 


DB 


76 


SEQ. ©0196 


DB 


77 


SEQ. ID 0197 


DB 


78 


SEQ. ID 0198 


DB 


79 


SEQ. ID 0199 


DB 


80 


SEQ. ID 0200 


DB 


81 


SEQ. ID 0201 


DB 


82 


SEQ. ID 0202 


DB 


83 


SEQ. ID 0203 


DB 


84 


SEQ. ID 0204 


DB 


85 


SEQ. ID 0205 


DB 


86 


SEQ. ID 0206 


DB 


87 


SEQ. ID 0207 


DB 


88 


SEQ. ID 0208 


DB 


89 


SEQ. ID 0209 


DB 


90 


SEQ. ID 0210 


Luc ■ 


1 


SEQ. ID 0211 


Luc 


2 


SEQ. ID 0212 


Luc 


3 


SEQ. ID 0213 


Luc 


4 


SEQ. ED 0214 


Luc 


5 


SEQ. ID 0215 


Luc 


6 


SEQ. ID 0216 


Luc 


7 


SEQ. ID 0217 


Luc 


8 


SEQ. ID 0218 


Luc 


9 


SEQ. ID 0219 


Luc 


10 


SEQ. ID 0220 


Luc 


11 


SEQ. ID 0221 


Luc 


12 


SEQ. ID 0222 


Luc 


13 


SEQ. ID 0223 



AAAGCUUACAUCAACAAAG 
AAGCUUACAUCAACAAAGU 
AGCUUACAUCAACAAAGUA 
GCUUACAUCAACAAAGUAG 
CUUACAUCAACAAAGUAGA 
UUACAUCAACAAAGUAGAA 
UACAUCAACAAAGUAGAAG 
ACAUCAACAAAGUAGAAGA 
CAUCAACAAAGUAGAAGAG 
AUCAACAAAGUAGAAGAGC 
UCAACAAAGUAGAAGAGCU 
CAACAAAGUAGAAGAGCUA 
AACAAAGUAGAAGAGCUAA 
ACAAAGUAGAAGAGCUAAA 
CAAAGUAGAAGAGCUAAAG 
AAAGUAGAAGAGCUAAAGA 
AAGUAGAAGAGCUAAAGAA 
AGUAGAAGAGCUAAAGAAA 
GUAGAAGAGCUAAAGAAAA 
UAGAAGAGCUAAAGAAAAA 
AGAAGAGCUAAAGAAAAAA 
GAAGAGCUAAAGAAAAAAU 
AAGAGCUAAAGAAAAAAUA 
AGAGCUAAAGAAAAAAUAC 
GAGCUAAAGAAAAAAUACG 
AGCUAAAGAAAAAAUACGG 
GCUAAAGAAAAAAUACGGG 
AUCCUCAUAAAGGCCAAGA 
AGAUCCUCAUAAAGGCCAA 
AGAGAUCCUCAUAAAGGCC 
AGAGAGAUCCUCAUAAAGG 
UCAGAGAGAUCCUCAUAAA 
AAUCAGAGAGAUCCUCAUA 
AAAAUCAGAGAGAUCCUCA 
GAAAAAUCAGAGAGAUCCU 
AAGAAAAAUCAGAGAGAUC 
GCAAGAAAAAUCAGAGAGA 
ACGCAAGAAAAAUCAGAGA 
CGACGCAAGAAAAAUCAGA 
CUCGACGCAAGAAAAAUCA 
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Luc 


14 


SEQ. ED 0224 


Luc 


15 


SEQ. ID 0225 


Luc 


16 


SEQ. ID 0226 


Luc 


17 


SEQ. ID 0227 


Luc 


18 


SEQ. ID 0228 


Luc 


19 


SEQ. ID 0229 


Luc 


20 


SEQ. ID 0230 


Luc 


21 


SEQ. ID 0231 


Luc 


22 


SEQ. ID 0232 


Luc 


23 


SEQ. ID 0233 


Luc 


24 


SEQ. ID 0234 


Luc 


25 


SEQ. ID 0235 


Luc 


26 


SEQ. ID 0236 


Luc 


27 


SEQ. ID 0237 


Luc 


28 


SEQ. ID 0238 


Luc 


29 


SEQ. ID 0239 


Luc 


30 


SEQ. ID 0240 


Luc 


31 


SEQ. ID 0241 


Luc 


32 


SEQ. ID 0242 


Luc 


33 


SEQ. ID 0243 


Luc 


34 


SEQ. ID 0244 


Luc 


35 


SEQ. ID 0245 


Luc 


36 


SEQ. ID 0246 


Luc 


37 


SEQ. ID 0247 


Luc 


38 


SEQ. ID 0248 


Luc 


39 


SEQ. ID 0249 


Luc 


40 


SEQ. ID 0250 


Luc 


41 


SEQ. ID 0251 


Luc 


42 


SEQ. ID 0252 


Luc 


43 


SEQ. ID 0253 


Luc 


44 


SEQ. ID 0254 


Luc 


45 


SEQ. ID 0255 


Luc 


46 


SEQ. ID 0256 


Luc 


47 


SEQ. ID 0257 


Luc 


48 


SEQ. ID 0258 


Luc 


49 


SEQ. ID 0259 




50 


SEQ. ID 0260 


Luc 


51 


SEQ. ID 0261 


Luc 


52 


SEQ. ID 0262 


Luc 


53 


SEQ. ID 0263 



AACUCGACGCAAGAAAAAU 
AAAACUCGACGCAAGAAAA 
GGAAAACUCGACGCAAGAA 
CCGGAAAACUCGACGCAAG 
UACCGGAAAACUCGACGCA 
CUUACCGGAAAACUCGACG 
GUCUUACCGGAAAACUCGA 
AGGUCUUACCGGAAAACUC 
AAAGGUCUUACCGGAAAAC 
CGAAAGGUCUUACCGGAAA 
ACCGAAAGGUCUUACCGGA 
GUACCGAAAGGUCUUACCG 
AAGUACCGAAAGGUCUUAC 
CGAAGUACCGAAAGGUCUU 
GACGAAGUACCGAAAGGUC 
UGGACGAAGUACCGAAAGG 
UGUGGACGAAGUACCGAAA 
UUUGUGGACGAAGUACCGA 
UGUUUGUGGACGAAGUACC 
UGUGUUUGUGGACGAAGUA 
GUUGUGUUUGUGGACGAAG 
GAGUUGUGUUUGUGGACGA 
AGGAGUUGUGUUUGUGGAC 
GGAGGAGUUGUGUUUGUGG 
GCGGAGGAGUUGUGUUUGU 
GCGCGGAGGAGUUGUGUUU 
UUGCGCGGAGGAGUUGUGU 
AGUUGCGCGGAGGAGUUGU 
AAAGUUGCGCGGAGGAGUU 
AAAAAGUUGCGCGGAGGAG 
CGAAAAAGUUGCGCGGAGG 
CGCGAAAAAGUUGCGCGGA 
ACCGCGAAAAAGUUGCGCG 
CAACCGCGAAAAAGUUGCG 
AACAACCGCGAAAAAGUUG 
GUAACAACCGCGAAAAAGU 
AAGUAACAACCGCGAAAAA 
UCAAGUAACAACCGCGAAA 
AGUCAAGUAACAACCGCGA 
CCAGUCAAGUAACAACCGC 
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Luc 


54 


SEQ.ID0264 


Luc 


55 


SEQ.ID0265 


Luc 


56 


SEQ.ID0266 


Luc 


57 


SEQ. ID 0267 


Luc 


58 


SEQ. ID 0268 


Luc 


59 


SEQ. ID 0269 


Luc 


60 


SEQ. ID 0270 


Luc 


61 


SEQ. ED 0271 


Luc 


62 


SEQ. ID 0272 


Luc 


63 


SEQ. ID 0273 


Luc 


64 


SEQ. ID 0274 


Luc 


65 


SEQ. ID 0275 


Luc 


66 


SEQ. ID 0276 


Luc 


67 


SEQ. ID 0277 


Luc 


68 


SEQ. ID 0278 


Luc 


69 


SEQ. ID 0279 


Luc 


70 


SEQ. ID 0280 


Luc 


71 


SEQ. ID 0281 


Luc 


72 


SEQ. ID 0282 


Luc 


73 


SEQ. ID 0283 


Luc 


74 


SEQ. ID 0284 


Luc 


75 


SEQ. ID 0285 


Luc 


76 


SEQ. ID 0286 


Luc 


77 


SEQ. ID 0287 


Luc ' 


78 


SEQ. ID 0288 


Luc 


79 


SEQ. ID 0289 


Luc 


80 


SEQ. ID 0290 


Luc 


81 


SEQ. ID 0291 


Luc 


82 


SEQ. ID 0292 


Luc 


83 


SEQ. ID 0293 


Luc 


84 


SEQ. ID 0294 


Luc 


85 


SEQ. ID 0295 


Luc 


86 


SEQ. ID 0296 




87 


SEQ. ID 0297 




88 


SEQ. ID 0298 


Luc 


89 


SEQ. ID 0299 


Luc 


90 


SEQ. ID 0300 



CGCCAGUCAAGUAACAACC 
GUCGCCAGUCAAGUAACAA 
ACGUCGCCAGUCAAGUAAC 
UUACGUCGCCAGUCAAGUA 
GAUUACGUCGCCAGUCAAG 
UGGAUUACGUCGCCAGUCA 
CGUGGAUUACGUCGCCAGU 
AUCGUGGAUUACGUCGCCA 
AGAUCGUGGAUUACGUCGC 
AGAGAUCGUGGAUUACGUC 
AAAGAGAUCGUGGAUUACG 
AAAAAGAGAUCGUGGAUUA 
GGAAAAAGAGAUCGUGGAU 
ACGGAAAAAGAGAUCGUGG 
UGACGGAAAAAGAGAUCGU 
GAUGACGGAAAAAGAGAUC 
ACGAUGACGGAAAAAGAGA 
AGACGAUGACGGAAAAAGA 
AAAGACGAUGACGGAAAAA 
GGAAAGACGAUGACGGAAA 
ACGGAAAGACGAUGACGGA 
GCACGGAAAGACGAUGACG 
GAGCACGGAAAGACGAUGA 
UGGAGCACGGAAAGACGAU 
UUUGGAGCACGGAAAGACG 
GUUUUGGAGCACGGAAAGA 
UUGUUUUGGAGCACGGAAA 
UGUUGUUUUGGAGCACGGA 
GUUGUUGUUUUGGAGCACG 
CCGUUGUUGUUUUGGAGCA 
CGCCGUUGUUGUUUUGGAG 
GCCGCCGUUGUUGUUUUGG 
CCGCCGCCGUUGUUGUUUU 
UCCCGCCGCCGUUGUUGUU 
CUUCCCGCCGCCGUUGUUG 
AACUUCCCGCCGCCGUUGU 
UGAACUUCCCGCCGCCGUU 



wo 2004/045543 PCT/US2003/036787 
64 

Example II. VaUdation of the Algorithm using DBI, Lnciferase, PLK, EGFR, 
and SEAP 

The algorithm (Formula VIII) identified siRNAs for five genes, human DBI, 
firefly luciferase (£Luc), renilla luciferase (rLuc), human PLK, and human secreted 
5 alkaline phosphatase (SEAP). Four individual siRNAs were selected on the basis of 
their SMARTscores™ derived by analysis of their sequence using Formula VIII (all 
of the siRNAs would be selected with Formula IX as well) and analyzed for their 
ability to silence their targets' expression, hi addition to the scoring, a BLAST search 
was conducted for each siRNA. To minimize the potential for off-target silencing 

1 0 effects, only those target sequences with more than three mismatches against un- 
related sequences were selected. Semizarov, etal. Specificity of short interfering RNA 
determined through gene expression signatures. Proc. Natl. Acad. Sci. U.S.A. 2003, 
100:6347. These duplexes were analyzed individually and in pools of 4 and 
compared with several siRNAs that were randomly selected. The functionality was 

1 5 measured a percentage of targeted gene knockdown as compared to controls. All 

SiRNAs were transfected as described by the methods above at 100 nM concentration 
into HEK293 using lipofectamine 2000. The level of the targeted gene expression 
was evaluated by B-DNA as described above and normalized to the non-specific 
control. Figure 10 shows that the siRNAs selected by the algorithm disclosed herein 

20 were significantly more potent than randomly selected siRNAs. Uie algorithm 
increased the chances of identifying an F50 siRNA fi-om 48% to 91%, and an F80 
SiRNA from 13% to 57%. hi addition, pools of SMART siRNA silence the selected 
target better than randomly selected pools (see Figure lOF). 

25 Example HI. VaUdation of the Algorithm Using Genes Involved in Clathrin- 
Dependent Endocytosis. 

Components of clathrin-mediated endocytosis pafliway are key to modulating 
intracellular signaling and play important roles in disease. Chromosomal 
rearrangements that result in fiision transcripts between the Mixed-Lineage Leukemia 
30 gene (MLL) and CALM (Clathrin assembly lymphoid myeloid leukemia gene) are 
believed to play a role in leukemogenesis. Similarly, disruptions in Rab7 and Rab9, 
as well as HIPl (Huntingtin-mteractmg protein), genes that are believed to be 
mvolved in endocytosis, are potentially responsible for aihnents resulting in lipid 
storage, and neuronal diseases, respectively. For these reasons, siRNA directed 
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against clathriii and other genes involved in the clathrin-mediated endocytotic 
pathway are potentially important research and therapeutic tools. 

siRNAs directed against genes involved in the clathrin-mediated endocytosis 
5 pathways were selected using Formula VIII. The targeted genes were clathrin heavy 
chain (CHC, accession # NM_004859), clathrin light chain A (CLCa, NM_001833), 
clathrin light chain B (CLCb, NM_001834), CALM (U45976), |32 subunit of AP-2 
m, NM_001282), Epsl5 (NM_001981), EpslSR (NM_021235), dynamin II 
(DYNII, NM_004945), RabSa (BC001267), RabSb (NM_O02868), RabSc 
10 (AF141304), and EEA.l (XM_018197). 

Tor each gene, four siRNAs duplexes with the highest scores were selected 
and a BLAST search was conducted for each of them using the Human EST database. 
In ordea: to minimize the potential for off-target silencing effecsts, only those 
1 5 sequences with more than three mismatches against un-related sequences were used. 
All duplexes were synthesized at Dharmacon, hic. as 21-mers with 3'-UU overhangs 
using a modified method of 2'-ACE chemistry Scaringe, Advanced 5'-silyl-2'- 
orthoester approach to RNA oligonucleotide synthesis, Methods Enzymol 2000, 317:3 
and the antisense strand was chemically phosphorylated to insure maximized activity. 

20 

HeLa cells were grown in Dulbecco's modified Eagle's medium (DMEM) 
containing 10% fetal bovine serum, antibiotics and glutamine. siRNA duplexes were 
resuspended in lX siRNA Universal buffer (Dharmacon, Inc.) to 20^iM prior to 
transfection. HeLa cells in 12-well plates were transfected twice with 4nl of 20|jM 

25 SiRNA duplex in 3jil Lipofectamine 2000 reagent (Invitrogen, Carlsbad, California, 
USA) at 24-hour intervals. For the transfections in which 2 or 3 siRNA duplexes were 
included, the amount of each duplex was decreased, so that the total amount was the 
same as in transfections with single siRNAs. Cells were plated mto normal culture 
medium 12 hours prior to experiments, and protein levels were measured 2 or 4 days 

30 after the first transfection. 



Equal amounts of lysates were resolved by electrophoresis, blotted, and 
stained with the antibody specific to targeted protein, as well as antibodies specific to 



wo 2004/045543 PCT/US2003/036787 
66 

unrelated proteins, PPl phosphatase and TsglOl (not shown). The cells were lysed in 
Triton X-lOO/glycerol solubilization buffer as described previously. Tebar, 
Bohlander, & Sorkin, Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) 
Protein: Localization in Endocy tic-coated Pits, Interactions with Clathrin, and the 
5 Impact of Overexpression on Clathrin-mediated Traffic, Mol. Biol. Cell Aug 1999, 
10:2687. Cell lysates were electrophoresed, transferred to nitrocellulose membranes, 
and Western blotting was performed with several antibodies followed by detection 
using enhanced chemiluminescence system (Pierce, Inc). Several x-ray fibns were 
analyzed to determine the linear range of the chemiluminescence signals, and the 
1 0 quantifications were performed using densitometry and Alphalmager v5.5 software 
(Alpha hmotech Corporation). In experiments with EpslSR-targeted siRNAs, cell 
lysates were subjected to mununoprecipitation with Ab860, and EpslSR was detected 
in immunoprecipitates by Western blotting as described above. 



1 5 The antibodies to assess the levels of each protein by Western blot were 

obtamed from the following sources: monoclonal antibody to clathrin heavy chain 
(TD.l) was obtained from American Type Culture Collection (Rockville, MD, USA); 
polyclonal antibody to dynamin n was obtained from Affinity Bioreagents, Inc. 
(Golden, CO, USA); monoclonal antibodies to EEA.1 and RabSa were purchased 

20 from BD Transduction Laboratories (Los Angeles, CA, USA); the monoclonal 

antibody to TsglOl was purchased from Santa Cruz Biotechnology, Inc. (Santa Cruz, 
CA, USA); the monoclonal antibody to GFP was from ZYMED Laboratories Inc. 
(South San Franciscoj CAj-USA); the rabbit polyclonal antibodies Ab32 specific to a- 
adaptins and Ab20 to CALM were described previously Sorkin, et al. Stoichiometric 

25 Interaction of the Epidermal Growth Factor Receptor -with the Clathrin-associated 
Protein Complex AP-2, J. Biol. Chem. Jan 1995, 270:619, the polyclonal antibodies to 
clathrin light chains A and B were kindly provided by Dr. F. Brodsky (UCSF); 
monoclonal antibodies to PPl (BD Transduction Laboratories) and a-Actinin 
(Chemicon) were kindly provided by Dr. M. Dell'Acqua (University of Colorado); 

30 Epsl 5 Ab577 and EpslSR Ab860 were kindly provided by Dr. P.P. Di Fiore 
(European Cancer Institute). 
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Figure 11 demonstrates the in vivo fimctionality of 48 individual siRNAs, 
selected using Formula VIII (most of them will meet the criteria incorporated by 
Formula IX as well) targeting 12 genes. Various cell lines were transfected with 
siRNA duplexes {Dupl-4) or pools of siRNA duplexes (Pool), and the cells were 
5 lysed 3 days after transfection with the excq)tion of CALM (2 days) and P2 (4 days). 

Note a P 1 -adaptin band (part of AP-1 Golgi adaptor complex) that runs 
slightly slower than P2 adaptm. CALM has two splice variants, 66 and 72 kD. The 
full-length Epsl 5R (a doublet of -130 kD) and several truncated spliced forms of ~ 

10 1 00 kD and ~70 kD were detected in Epsl 5R immunoprecipitates (shown by arrows). 
The cells were lysed 3 days after transfection. Equal amounts of lysates were 
resolved by electrophoresis and blotted with the antibody specific to a targeted protein 
(GFP antibody for YFP fijsion proteins) and the antibody specific to unrelated 
proteins PPl phosphatase or a-actinin, and TSGIOI. The amount of protein in each 

1 5 specific band was normalized to the amount of non-specific proteins in each lane of 
the gel. Nearly all of them appear to be fimctional, which estabUshes that Formula 
VIII and DC can be used to predict siRNAs' fimctionality in general in a genome wide 
manner. 

20 To generate the fusion of yellow fluorescent protein (YFP) with RabSb or 

RabSc (YFP-Rab5b or YFP-Rab5c), a DNA firagment encoding the full-length human 
RabSb or RabSc was obtained by PGR using Pfu polymerase (Stratagene) witli a Sad 
restriction site introducey ihfd'the 5' end and a Kpjil site into the 3' end and cloned 
into pEYFP-Cl vector (CLONTECH, Palo Alto, CA, USA). GFP-CALM and YFP- 

25 RabSa were described previously Tebar, Bohlander, & Sorkin, Clathrin Assembly 

Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, 
Interactiom with Clathrin. and the Impact of Overexpression on Clathrin-mediated 
Traffic, Mol. Biol. Cell Aug 1999, 10:2687. 

30 Example in. Validation of the Algorithm Usmg Eg5, GADPH, ATEl, MEK2, 
MEKl, QB, LaminA/C, c-myc, human cyclophilin, and mouse cyclophUin. 
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A number of genes have been identified as playing potentially important roles 
in disease etiology. Expression profdes of normal and diseased kidneys has 
implicated Edg5 in immimoglobulin A neuropathy, a common renal glomerular 
disease. Mycl, lVIEKl/2 and other related kinases have been associated with one or 
5 more cancers, while lamins have been implicated in muscular dystrophy and other 
diseases. For these reasons, siRNA directed against the genes encoding these classes 
of molecules would be important research and therapeutic tools. 

Figure 12 illustrates four siRNAs targeting 10 different genes (Table V for 
10 sequence and accession number information) that were selected according to the 
Formula VIII and assayed as individuals and pools in HBK293 cells. The level of 
siRNA induced silencing was measured using the B-DNA assay. These studies 
demonstrated that thirty-six out of the forty individual SMART-selected siRNA tested 
are fimctional (90%) and alllO pools are fully fimctional. 

15 

Example Y. Validation of the Algorithm Using Bcl2 

Bcl-2 is a ~25kD, 205-239 amino add, anti-apoptotic protein that contains 
considerable homology vdth other members of the BCL family including BCLX, 
MCLl , BAX, BAD, and BIK. The protein exists in at least two forms (BcLZa, which 

20 has a hydrophobic tail for membrane anchorage, and Bcl2b, which lacks the 

hydrophobic tail) and is predominantly localized to the mitochondrial membrane. 
While Bcl2 expression is widely distributed, particular interest has focused on the 
expression of this molecule jn Biand T cells. Bcl2 expression is down-regulated in 
normal germinal center B cells yet in a high percentage of follicular lymphomas, Bcl2 

25 expression has been observed to be elevated. Cytological studies have identified a 

common translocation ((14;18)(q32;q32)) amongst a high percentage (>70%) of these 
lymphomas. This, genetic lesion places the Bcl2 gene in juxtaposition to 
immunoglobulin heavy chain gene (IgH) encoding sequences and is believed to 
enforce inappropriate levels of gene expression, and resistance to programmed cell 

30 death in the follicle center B cells. In other cases, hypomethylation of the Bcl2 

promoter leads to enhanced expression and again, inhibition of apoptosis. In addition 
to cancer, dysregulated expression of Bcl-2 has been correlated with multiple sclerosis 
and various neurological diseases. 
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The correlation between Bcl-2 translocation and cancer makes this gene an 
attractive target for RNAi. Identification of siRNA directed against the bcl2 transcript 
(or Bcl2-IgH fbsions) would further our understanding Bcl2 gene function and 
possibly provide a future therapeutic agent to battle diseases that result from altered 
5 expression or function of this gene. 

In Silico Identification of Functional siRNA 

To identify functional and hyperfunctional siRNA against the Bcl2 gene, the 
sequence for Bcl-2 was downloaded from the NCBI Unigene database and analyzed 
1 0 using the Formula VHI algorithm. As a result of these procedures, both the sequence 
and SMARTscores™ of the Bcl2 siRNA were obtained and ranked according to their 
fimctionality. Subsequently, these sequences were BLAST' ed (database) to insure that 
the selected sequences were specific and contained minimal overlap with unrealated 
genes. The SMARTscores™ for the top 10 Bcl-2 siRNA are identified in Figure 13. 

15 

In Vivo Testing of Bcl-2 SiRNA 

Bcl-2 siRNAs having the top ten SMARTscores™ were selected and tested in 
a ftmctional assay to determine silencing efficiency. To accomplish this, each of the 
ten duplexes were synthesized using 2'-0-ACE chemistry and transfected at 1 OOnM 
. 20 concentrations into cells. Twenty-four hours later assays were performed on cell 
extracts to assess the degree of target silencing. Controls used in these e}q)eriments 
included mock transfected cells, and cells that were transfected with a non-specific 
siRNA duplex. „ ^^.^^^ . 

25 The results of these experiments are presented below (and in Figure 14) and 

show that all ten of the selected siRNA induce 80% or better silencmg of the Bcl2 
message at lOOnM concentrations. These data verify that the algorithm successfully 
identified functional Bcl2 siRNA and provide a set of fimctional agents that can be 
used in experimental and therapeutic enviromnents. 
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siRNAl GGGAGAUAGUGAUGAAGUA SEQ.IDNO.301 

siRNA2 GAAGUACAUCCAUUAUAAG SEQ. IDNO.302 

siRNA3 GUACGACAACCGGGAGAUA SEQ. E) NO. 303 

siRNA4 AGAUAGUGAUGAAGUACAU SEQ. ID NO. 304 

siRNA5 UGAAGACUCUGCUCAGUUU SEQ. ID NO. 305 

siRNA6 GCAUGCGGCCUCUGUUUGA SEQ. ID NO. 306 

siRNA7 UGCGGCCUCUGUUUGAUUU SEQ. ID NO. 307 

siRNAS GAGAUAGUGAUGAAGUACA SEQ. ID NO. 308 

siRNA9 GGAGAUAGUGAUGAAGUAC SEQ. ID NO. 309 

siRNA 10 GAAGACUCUGCUCAGUUUG SEQ. ID NO. 3 10 

Bcl2 siKNA: Sense Strand, 5'->3' 



Example VI. Sequences Selected by the Algorithm 

Sequences of the siRNAs selected using Fonnulas (Algorithms) VIII and IX 
1 5 with their corresponding ranking, which have been evaluated for the silencing activity 
in vivo in the present study (Formula VIII and DC, respectively). 



TABLE V 



Gene 


Accession 






Fonnula 


Formula 


Name 


Number 


SEQ. ID NO. 


FTllSeqTence 


VIII 


IX 


CLTC 


MM_004859 


SEQ. ID NO. 0301 


GAAAGAATCTGTAGAGAAA 


76 


94.2 


CLTC 


NM_004859 


SEQ. ID NO. 0302 


GCAATGAGCTGTTTGAAGA 


65 


39.9 


CLTC 


TSIM_004859 


SEQ. ID NO. 0303 


TGACAAAGGTGGATAAATT 


57 


38.2 


CLTC 


NM_004859 


SEQ. ID NO. 0304 


GGAAATGGATCTCmGAA 


54 


49.4 


CLTA 


NM_001833 


SEQ. ID NO. 0305 


GGAAAGTAATGGTCCAACA 


22 


55.5 


CLTA 


NM_001833 


SEQ. ID NO. 0306 


AGACAGTTATGCAGCTATT 


4 


22.9 


CLTA 


NM_001833 


SEQ. ID NO. 0307 


CCAATTCTCGGAAGCAAGA 


1 


17 


CLTA 


NM_001833 


SEQ. ID NO. 0308 


GAAAGTAATGGTCCAACAG 


-1 


-13 


CLTB 


NM_001834 


SEQ. ID NO. 0309 


GCGCCAGAGTGAACAAGTA 


17 


57.5 


CLTB 


NM_001834 


SEQ. ID NO. 0310 


GAAGGTGGCCCAGCTATGT 


15 


-8.6 


CLTB 


NM_001834 


SEQ. ID NO. 0311 


GGAACCAGCGCCAGAGTGA 


13 


40.5 


CLTB 


NM_001834 


SEQ. ED NO. 0312 


GAGCGAGATTGCAGGCATA 


20 


61.7 


CALM 


U45976 


SEQ:IDN0.0313 


GTTAGTATCTGATGACTTG 


36 


-34.6 


CALM 


U45976 


SEQ. ID NO. 03 14 


GAAATGGAACCACTAAGAA 


33 


46.1 


CALM 


U45976 


SEQ. ID NO. 0315 


GGAAATGGAACCACTAAGA 


30 


61.2 


CALM 


U45976 


SEQ. ID NO. 0316 


CAACTACACTTTCCAATGC 


28 


6.8 


EPS15 


NM_001981 


SEQ. ID NO. 0317 


CCACCAAGATTTCATGATA 


48 


25.2 
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EPS15 


NM_001981 


SEQ.IDNO.0318 


EPS15 


NM_001981 


SEQ.IDNO.0319 


EPS 1 5 


NM_001981 


SEQ.IDNO. 0320 


EPS15R 


NM_021235 


SEQ.IDNO. 0321 


EPS15R 


NM_021235 


SEQ. ID NO. 0322 


EPS15R 


NM_021235 


SEQ.IDNO. 0323 


EPS15R 


NM_021235 


SEQ. ID NO. 0324 


DNM2 


NM_004945 


SEQ. ID NO. 0325 


DNM2 


NM_004945 


SEQ. ED NO. 0326 


DNM2 


NM_004945 


SEQ. ID NO. 0327 


DNM2 


NM_004945 


SEQ. ID NO. 0328 


ARF6 


AF93885 


SEQ. ID NO. 0329 


ARF6 


AF93885 


SEQ. ID NO. 0330 


ARF6 


AF93885 


SEQ. .IP NO. 0331 


ARF6 


AF93885 


SEQ. ID NO. 0332 


RAB5A 


BC001267 


SEQ. ID NO. 0333 


RAB5A 


BC001267 


SEQ. ID NO. 0334 


RAB5A 


BC001267 


SEQ. ID NO. 0335 


RAB5A 


BC001267 


SEQ.IDNO. 0336 


RAB5B 


NM_002868 


SEQ.IDNO. 0337 


RAB5B 


NM_002868 


SEQ. ID NO. 0338 


RAB5B 


NM_002868 


SEQ.IDNO. 0339 


RAB5B 


NM_002868 


SEQ. ID NO. 0340 


RAB5C 


AF141304 


SEQ. ID NO. 0341 


RAB5C 


AF141304 


SEQ. ID NO. 0342 


RAB5C 


AF141304 


SEQ. ID NO. 0343 


RAB5C 


AF141304 


SEQ. ID NO. 0344 


EEAl 


XM_018197 


SEQ. ID NO. 0345 


EEAl 


XM_018197 


SEQ. ID NO. 0346 


EEAl 


XM_018197 


SEQ. ID NO. 0347 


EEAl 


XM_018197 


SEQ. ID NO. 0348 


AP2B1 


NM_001282 


SEQ. ID NO. 0349 


AP2B1 


NM_001282 


SEQ. ID NO. 0350 


AP2B1 


NM_001282 


SEQ. ID NO. 0351 


AP2B1 


NM_001282 


SEQ. ID NO. 0352 


PLK 


NM_005030 


SEQ. ID NO. 0353 


PLK 


NM_005030 


SEQ. ID NO. 0354 


PLK 


NM_005030 


SEQ. ID NO. 0355 


PLK 


NM_005030 


SEQ. ID NO. 0356 


GAPDH 


NM_002046 


SEQ. ID NO. 0357 
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GATCGGAACTCCAACAAGA 


43 


49.3 


AAACGGAGCTACAGATTAT 


39 


11.5 


CCACACAGCATTCTTGTAA 


33 


-23.6 


GAAGTTACCTTGAGCAATC 


48 


33 


GGACTTGGCCGATCCAGAA 


27 


33 


GCACTTGGATCGAGATGAG 


20 


1.3 


CAAAGACCAATTCGCGTTA 


17 


27.7 


CCGAATCAATCGCATCTTC 


6 


-29.6 


GACATGATCCTGCAGTTCA 


5 


-14 


GAGCGAATCGTCACCACTT 


5 


24 


CCTCCGAGCTGGCGTCTAC 


-4 


-63.6 


TCACATGGTTAACCTCTAA 


27 


-21.1 


GATGAGGGACGCCATAATC 


7 


-38.4 


CCTCTAACTACAAATCITA 


4 


16.9 


GGAAGGTGCTATCCAAAAT 


4 


11.5 


GCAAGCAAGTCCTAACATT 


40 


25.1 


GGAAGAGGAGTAGACCTTA 


17 


50.1 


AGGAATCAGTGTTGTAGTA 


16 


11.5 


GAAGAGGAGTAGACCTTAC 


12 


7 


GAAAGTCAAGCCTGGTATT 


14 


18.1 


AAAGTCAAGCCTGGTATTA 


6 


-17.8 


GCTATGAACGTGAATGATC 


3 


-21.1 


CAAGCCTGGTATTACGTTT 


-7 


-37.5 


GGAACAAGATCTGTCAATT 


38 


51.9, 


GCAATGAACGTGAACGAAA 


29 


43.7 


CAATGAACGTGAACGAAAT ■ 


18 


43.3 


GGACAGGAGCGGTATCACA 


6 


18.2 


"AdACAGAGCTTGAGAATAA 


67 


64.1 


GAGAAGATCTTTATGCAAA 


60 


48.7 


GAAGAGAAATCAGCAGATA 


58 


45.7 


GCAAGTAACTCAACTAACA 


56 


72.3 


GAGCTAATCTGCCACATTG 


49 


-12.4 


GCAGATGAGTTACTAGAAA 


44 


48.9 


CAACTTAATTGTCCAGAAA 


41 


28.2 


CAACACAGGATTCTGATAA 


33 


-5.8 


AGATTGTGCCTAAGTCTCT 


-35 


-3.4 


ATGAAGATCTGGAGGTGAA 


0 


^.3 


TTTGAGACTTCTTGCCTAA 


-5 


-27.7 


AGATCACCCTCCTTAAATA 


15 


72.3 


CAACGGATTTGGTCGTATT 


27 


-2.8 
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GAPDH 


NM_002046 


SEQ.ro NO. 0358 


GAPDH 


]SIM_002046 


SEQ.ro NO. 0359 


GAPDH 


NM_002046 


SEQ.ro NO. 0360 


c-Myc 




SEQ.ro NO. 0361 


c-Myc 




SEQ.ro NO. 0362 


c-Myc 




SEQ. ro NO. 0363 


c-Myc 




SEQ.ro NO. 0364 


MAP2K1 


NM_002755 


SEQ.ro NO. 0365 


MAP2K1 


NM_002755 


SEQ.ro NO. 0366 


MAP2K1 


>}M_002755 


SEQ.ro NO. 0367 


MAP2K1 


NM_002755 


SEQ.ro NO. 0368 


MAP2K2 


NM_030662 


SEQ.ro NO. 0369 


MAP2K2 


NM_030662 


SEQ.ro NO. 0370 


MAP2K2 


NM_030662 


SEQ. ID NO. 0371 


MAP2K2 


NM_030662 


SEQ.ro NO. 0372 


KNSL1(EG5) 


NM_004523 


SEQ.ro NO. 0373 


KNSL1(EG5) 


NM_004523 


SEQ.ro NO. 0374 


KNSL1(EG5) 


NM_004523 


SEQ.ro NO. 0375 


KNSL1(EG5) 


NM_004523 


SEQ.ro NO. 0376 


CyclophilinA_ 


NM_021130 


SEQ.ro NO. 0377 


CyclophilinA_ 


NM_021130 


SEQ.ro NO. 0378 


CyclophilinA_ 


NM_021130 


SEQ.ro NO. 0379 


CyclophilinA_ 


NM_021130 


SEQ.ro NO. 0380 


LAMINA/C 


NM_170707 


SEQ.ro NO. 0381 


LAMINA/C 


NM_170707 


SEQ.ro NO. 0382 


LAMINA/C 


NM_170707 


SEQ.ro NO. 0383 


LAMINA/C 


NM_1 70707 


SEQ.ro NO. 0384 


CyclophilinB 


M60857 


SEQ.ro NO. 0385 


CyclophilinB 


M60857 


SEQ.ro NO. 0386 


CyclophilinB 


M60857 


SEQ.ro NO. 0387 


CyclophilinB 


M60857 


SEQ.ro NO. 0388 


DBll 


NM_020548 


SEQ.ro NO. 0389 


DBI2 


NM_020548 


SEQ.ro NO. 0390 


DBI3 


NM_020548 


SEQ.ro NO. 0391 


DBI4 


NM_020548 


SEQ.ro NO. 0392 


rLUCl 




SEQ.ro NO. 0393 


rLUC2 




SEQ.ro NO. 0394 


rLUC3 




SEQ.ro NO. 0395 


rLUC4 




SEQ.ro NO. 0396 


SeAPl 


NM_031313 


SEQ.ro NO. 0397 
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GAAATCCCATCACCATCTT 


24 


3.9 


GACCTCAACTACATGGTTT 


22 


-22.9 


TGGTTTACATGTTCCAATA 


9 


9.8 


GAAGAAATCGATGTTGTTT 


31 


-11.7 


ACACAAACTTGAACAGCTA 


22 


51.3 


GGAAGAAATCGATGTTGTT 


18 


26 


GAAACGACGAGAACAGTTG 


18 


-8.9 


GCACATGGATGGAGGTTCT 


26 


16 


GCAGAGAGAGCAGATTTGA 


16 


0.4 


GAGGTTCTCTGGATCAAGT 


14 


15.5 


GAGCAGATTTGAAGCAACT 


14 


18.5 


CAAAGACGATGAC1TCGAA 


37 


26.4 


GATCAGCATTTGCATGGAA 


24 


-0.7 


TCCAGGAGTTTGTCAATAA 


17 


-4.5 


GGAAGCTGATCCACCTTGA 


16 


59.2 


GCAGAAATCTAAGGATATA 


53 


35.8 


CAACAAGGATGAAGTCTAT 


50 


18.3 


CAGCAGAAATCTAAGGATA 


41 


32.7 


CTAGATGGCTTTCTCAGTA 


39 


3.9 


AGACAAGGTCCCAAAGACA 


-16 


58.1 


GGAATGGCAAGACCAGCAA 


-6 


36 


AGAATTATTCCAGGGITTA 


-3 


16.1 


. GCAGACAAGGTCCCAAAGA 


8 


8.9 


AGAAGCAGCTTCAGGATGA 


31 


38.8 


GAGCTTGACTTCCAGAAGA 


33 


22.4 


CCACCGAAGTTCACCCTAA 


21 


27.5 


GAGAAGAGCTCCTCCATCA 


55 


30.1 


GAAAGAGCATCTACGGTGA 


41 


83.9 


GAAAGGATTTGGCTACAAA 


53 


59.1 


acagcaaattccatcgtgt 


-20 


28.8 


GGAAAGACTGTTCCAAAAA 


2 


27 


CAACACGCCTCATCCTCTA 


27 


-7.6 


CATGAAAGCTTACATCAAC 


25' 


-30.8 


AAGATGCCATGAAAGCTTA 


17 


22 


GCACATACCGCCTGAGTCT 


15 


3.9 


GATCAAATCTGAAGAAGGA 


57 


49.2 


GCCAAGAAGTTTCCTAATA 


50 


13.7 


CAGCATATCTTGAACCATT 


41 


-2.2 


GAACAAAGGAAACGGATGA 


39 


29.2 


CGGAAACGGTCCAGGCTAT 


6 


26.9 
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SeAP2 


NM_031313 


SEQ.IDNO.0398 


SeAP3 


NM_031313 


SEQ.IDNO.0399 


SeAP4 


NM_031313 


SEQ. ED NO. 0400 


fLUCl 




SEQ.IDNO.0401 


fLUC2 




SEQ. ID NO. 0402 


£LUC3 




SEQ. ID NO. 0403 


£LUC4 




SEQ. ID NO. 0404 


mCyclo_l 


NM_008907 


SEQ. ID NO. 0405 


mCyclo_2 


NM_0O8907 


SEQ. ID NO. 0406 


mCyclo_3 


NM_008907 


SEQ. ID NO. 0407 


mCyclo_4 


NM_008907 


SEQ. ID NO. 0408 


BCL2_1 


MM_000633 


SEQ. ID NO. 0409 


BCL2_2 


NM_000633 


SEQ. ID NO. 0410 


BCL2_3 


NM_000633 


SEQ.IDNO.04J1 


BCL2_4 


NM_000633 


SEQ. ID NO. 0412 


BCL2_5 


NM_000633 


SEQ. ID NO. 0413 


BCL2_6 


NM_000533 


SEQ. ID NO. 0414 


QBl 


NM_003365.1 


SEQ. ID NO. 0415 


QB2 


NM_003365.1 


SEQ. ID NO. 0416 


QB3 


NM_003365.1 


SEQ. ID NO. 0417 


QB4 


NM_003365.1 


SEQ. ID NO. 0418 


ATEl-1 


NM_007041 


SEQ. ID NO. 0419 


ATEl-2 


NM_007041 


SEQ. ID NO. 0420 


ATEl-3 


.- . ■ NM_007041 


SEQ. ID NO. 0421 


ATEl-4 


NM_00704i 


SEQ. ID NO. 0422 


EGFR-1 


NM_005228 


SEQ. ID NO. 0423 


EGFR-1 


NM_005228 


SEQ. ID NO. 0424 


EGFR-1 


NM_005228 


SEQ. ID NO. 0425 


EGFR-1 


NM_005228 


SEQ. ID NO. 0426 
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GCTTCGAGCAGACATGATA 


4 


-11.2 


CCTACACGGTCCTCCTATA 


4 


4.9 


GCCAAGAACCTCATCATCT 


1 


-9.9 


GATATGGGCTGAATACAAA 


54 


40.4 


GCACTCTGATTGACAAATA 


47 


54.7 


TGAAGTCTCTGATTAAGTA 


46 


34.5 


TCAGAGAGATCCTCATAAA 


40 


11.4 


GCAAGAAGATCACCATTTC 


52 


46.4 


GAGAGAAATTTGAGGATGA 


36 


70.7 


GAAAGGATTTGGCTATAAG 


35 


-1.5 


GAAAGAAGGCATGAACATT 


27 


10.3 


GGGAGATAGTGATGAAGTA 


21 


72 


GAAGTACATCCATTATAAG 


1 


3.3 


GTACGACAACCGGGAGATA 


1 


35.9 


AGATAGTGATGAAGTACAT 


-12 


22.1 


TGAAGACTCTGCTCAGTTT 


36 


19.1 


GCATGCGGCCTCTGTTTGA 


5 


-9.7 


GCACACAGCUUACUACAUC 


52 


-4.8 


GAAAUGCCCUGGUAUCUCA 


49 


22.1 


GAAGGAACGUGAUGUGAUC 


34 


22.9 


GCACUACUCCUGUGUGUGA 


28 


20.4 


GAACCCAGCUGGAGAACUU 


45 


15.5 


GAUAUACAGUGUGAUCUUA 


40 


12.2 


GUACUACGAUCCUGAUUAU 


37 


32.9 


GUGCCGACCUUUACAAUUU 


35 


18.2 


GAAGGAAACTGAATTCAAA 


68 


79.4 


GGAAATATGTACTACGAAA 


49 


49.5 


CCACAAAGCAGTGAATTTA 


41 


7.6 


GTAACAAGCTCACGCAGTT 


40 


25.9 



Example VII. Genome-Wide Application of the Algorithm 

The examples described above demonstrate that the algorifhm(s) can 
5 successfully identify functional siKNA and that these duplexes can be used to induce 
the desirable phenotype of transcriptional knockdown or knockout. Each gene or 
family of genes in each organism plays an important role in maintaining physiological 
homeostasis and the algorithm can be used to develop functional, highly functional, or 
hyperfunctional siKNA to each gene. To accomplish this for the human genome, the 
10 entire online ncbi refseq database was accessed through Entrez (efetch). The database 
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was processed through Formula VIII. For each gene the top 80 -100 scores for 
siRNAs were obtained and BLAST'ed to insure that the selected sequences are 
specific in targeting the gene of choice. These sequences are provided on the 
enclosed CDs in electronic form. Accordingly, Applicants hereby incorporate by 

5 reference the material submitted herewith, in duplicate on the compact disks labeled 
COPY 1 - TABLES PART, DISK 1/1, TABLES 12 -15, Filed with ROAJS under 
PCT AI sec. 801(a), Operating System: MS-Windows, COPY 2 - TABLES PART, 
DISK 1/1, TABLES 12-15, Filed with RO/US under PCT AI sec. 801(a), Operating 
System: MS-Windows, COPY 3 - TABLES PART, DISK 1/1, TABLES 12 -15, 

10 Filed with RO/US under PCT AI sec. 801(a), Operating System: MS-Windows,; 
which copies are identical, in files entitled Table_12.titt, date of creation June 26, 
2003, with a size of 31,045 kb; Table_13.txt, date of creation November 13, 2003, 
with a size of 78,451 kb; Table_14.txt, date of creation November 13, 2003, with a 
size of 454 kb; and Table_15.txt date of creation November 13, 2003, with a size of 

15 1,690 kb. 

With respect to the disks, there are four tables on each disk copy in text 
format: Tables XII -XV. Table XII, which is located in a file entitled Table_12.txt, 
provides a list of the 80-100 sequences for each target, identified by Formula VIII as 

,20 having the highest relative SMARTscores™ for the target analyzed. Table XIII, 

which is located in a file entitled Table l 3 .txt, provides the SMARTscores™, and for 
each gene, a pool pick of up to four sequences is denoted. (The denotation of "1" in 
Table XIII means that it is a pool pick.) These pool pick sequences represent the most 
functional siRNAs for the corresponding target. Any 1, 2, 3, or 4 of the pool pick 

25 sequences could be used for gene silencing. Further, sequences that are not denoted 
as pool pick sequences, but that are included on the compact disks may also be used 
for gene silencing either alone or in combination with other sequences. However, 
their individual relative fimctionality would be less than that of a pool pick sequence. 
Table XTV, which is located in a file entitled Table_14.txt, provides an identification 

30 of genes by accession number, and Table XV, which is located in a file entitled 
Table_15.txt, provides a short name for the genes identified on the disk. The 
information contained on the disks is part of this patent application and are 
incorporated into the specification by reference. One may use these tables in order to 
identify ftinctional siRNAs for the gene provided therein, by sunply looking for the 



wo 2004/045543 PCT/US2003/036787 
75 

gene of interest and an siRNA that is listed as fiinctional. Preferably, one would 
select one or more of the siRNA that most optimized for the target of interest and is 
denoted as a pool pick. 



5 Table XII: siRNA selected by Formula VIII 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 



Table Xni: SMARTscores™ 
10 See data submitted herewith on a CD-ROM in accordance with PCT 

Administrative Instructions Section 801(a) 



Table XIV: Identification of Targets 

See data submitted herewith on a CD-ROM in accordance with PCT 
15 Administrative Instructions Section 801(a) 

Table XV: Description of Targts 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions JSection 801(a) 

Many of the genes to which the described siRNA are directed play critical 
roles in disease etiology. For this reason, the siRNA listed in the accompanying 
compact disk may potentially act as therapeutic agents.. A. number of prophetic 
examples follow and should be understood in view of the siRNA that are identified on 
the accompanying CD. To isolate these siRNA, the appropriate message sequence for 
each gene is analyzed using one of the before mentioned formulas (preferably formula 
VIII) to identify potential siRNA targets. Subsequently these targets are BLAST'ed to 
eliminate homology with potentially off-targets. 

30 The list of potential disease targets is extensive. For instance, over-expression 

of Bel 10 has been implicated in tiie development of MALT lymphoma (mucosa 
associated lymphoid tissue lymphoma) and thus, functional, highly functional, or 
hyperfimctional siKNA directed against that gene (e.^. SEQ. ID NO. 0427: 
GGAAACCUCUCAUUGCUAA; SEQ. ID NO. 0428: 



' 20 



25 
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GAAAGAACCUUGCCGAUCA; SEQ. IDNO. 0429: 
GGAAAUACAUCAGAGCUUA, or SEQ. ID NO. 0430: 
GAAAGUAUGUGUCUUAAGU) may contribute to treatment of this disorder. 

5 In another example, studies have shown that molecules that inhibit 

glutamine:fiructose-6-phosphate aminotransferase (GFA) may act to limit the 
symptoms suffered by Type II diabetics. Thus, functional, highly functional, or 
hyperfiinctional siKNTA directed against GFA (also known as GFPTl : siRNA = SEQ. 
ID NO. 0433 UGAAACGGCUGCCUGAUUU; SEQ. ID NO. 0434 
10 GAAGUUACCUCUUACAUUU; SEQ. ID NO. 0435 
GUACGAAACUGUAUGAUUA; SEQ. ID NO. 0436 

GGACGAGGCUAUCAUUAUG) may contribute to treatment of this disorder. 

In another example, the von Hippel-Lindau (VHL) tumor suppressor has been 

1 5 observed to be inactivated at a high frequency in sporadic clear cell renal cell 
carcinoma (RCC) and RCCs associated with VHL disease. The VHL tumor 
suppressor targets hypoxia-inducible factor-1 alpha (HEP-l alpha), a transcription 
factor that can induce vascular endothelial growth factor (VEGF) expression, for 
ubiquitination and degradation. Inactivation of VHL can lead to increased levels of 

20 HIF-1 alpha, and subsequent VEGF over expression. Such over expression of VEGF 
has been used to explain the increiased (and possibly necessary) vascularity observed 
in RCC. Thus, functional, highly functional, or hyperfunctional siRNAs directed 
against either HIF-1 alpha (SEQ. ID NO. 0437 GAAGGAACGPjAUGCUUUA; 
SEQ. ID NO. 0438 GCAUAUAUCUAGAAGGUAU; SEQ. IDNO. 0439 

25 GAACAAAUACAUGGGAUUA; SEQ. ID NO. 0440 

GGACACAGAUUUAGACUUG) or VEGF (SEQ. ID NO. 0441 
GAACGUACUUGCAGAUGUG; SEQ. ID NO. 0442 
GAGAAAGCAUUUGUUUGUA; SEQ. ID NO. 0443 
GGAGAAAGCAUUUGUUUGU; SEQ. ID NO. 0444 

30 CGAGGCAGCUUGAGUUAAA) maybe useful in the treatment of renal cell 
carcinoma. 

In another example, gene expression of platelet derived growth factor A and B 
(PDGF-A and PDGF-B) has been observed to be increased 22- and 6-fold, 
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respectively, in renal tissues taken from patients with diabetic nephropathy as 
compared with controls. These findings suggest that over expression of PDGF A and 
B may play a role in the development of the progressive fibrosis that characterizes 
human diabetic kidney disease. Thus, functional, highly functional, or hyperfunctional 
5 siRNAs directed against either PDGF A 

(SEQ. ID NO. 0445: GGUAAGAUAUUGUGCUUUA; 
SEQ. ID NO. 0446: CCGCAAAUAUGCAGAAUUA; 
SEQ. m NO. 0447: GGAUGUACAUGGCGUGUUA; 
SEQ. ID NO. 0448: GGUGAAGUUUGUAUGUUUA) or 

10 

PDGFB 

(SEQ. ID NO. 0449: CCGAGGAGCUUUAUGAGAU- 
SEQ. ID NO. 0450: GCUCCGCGCUUUCCGAUUU; 
SEQ. ID NO. 0451 GAGCAGGAAUGGUGAGAUG; 
1 5 SEQ. ID NO. 0452: GAACUUGGGAUAAGAGUGU; 
SEQ. ID NO. 0453 CCGAGGAGCUUUAUGAGAU; 

SEQ. ID NO. 0454 UUUAUGAGAUGCUGAGUGA) may be useful in the treatment 
of this fonn of kidney disorder. 

20 In another example, a strong correlation exists between the over-expression of 

glucose bransporters (e.g. GLUT12) and cancer cells. It is predicted that cells 
undergoing uncontrolled cell growth up-regulate GLUT molecules so that they can 
cope with the heightened energy needs associated with increased.tates.of proliferation 
and metastasis. Thus, siRNA-based therapies that target the molecules such as 

25 GLUTl (also known as SLC2A1 : siRNA= 

SEQ. ID NO.: 0455 GCAAUGAUGUCCAGAAGAA; 
SEQ. ID NO.: 0456 GAAGAAUAUUCAGGACUUA; 
SEQ. ID NO.: 0457 GAAGAGAGUCGGCAGAUGA; 
SEQ. ID NO.: 0458 CCAAGAGUGUGCUAAAGAA) 

30 

GLUT12 (also known as SLCA12: siRNA = 
SEQ. ID NO. 0459: GAGACACUCUGAAAUGAUA; 
SEQ. ID NO. 0460: GAAAUGAUGUGGAUAAGAG; 
SEQ. ID NO. 0461 : GAUCAAAUCCUCCOJGAAA; 
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SEQ. ID NO. 0462: UGAAUGAGCUGAUGAUUGU) and other related transporters, 
may be of value in treating a multitude of malignancies. 

The siRNA sequences listed above are presented in a 5'^ 3' sense strand 
i direction. In addition, siRNA directed against the targets listed above as well as those 
directed against other targets and listed in the accompanying compact disk may be 
useful as therapeutic agents. 

Example VIII. Evidence for the Benefits of Pooling 
I Evidence for the benefits of pooling have been demonstrated using the 

reporter gene, luciferase. Ninety siRNA duplexes were synthesized using Dharmacon 
proprietary ACE® chemistry against one of the standard reporter genes: firefly 
luciferase. The duplexes were designed to start two base pairs apart and to cover 
approximately 1 80 base pairs of the luciferase gene (see sequences in Table III). 
Subsequently, the siRNA duplexes were co-transfected with a luciferase expression 
reporter plasmid into HEK293 cells using standard transfection protocols and 
luciferase activity was assayed at 24 and 48 hours. 

Transfection of individual siRNAs showed standard distribution of inhibitory 
effect. Some duplexes wene^ctive, while others were not. Figure 15 represents a 
typical screen of ninety siRNA duplexes (SEQ. ID NO. 0032- 0120) positioned two 
base pairs apart. As the figure suggests, the fimctionaUty of the siRNA duplex is 
determined more by a particular sequence of the oHgonucleotide than bjOharelative 
oligonucleotide position within a gene or excessively sensitive part of the mRNA, 
which is important for tiraditional anti-s«aise technology. 

When two continuous oligonucleotides were pooled together, a significant 
increase in gene silencing activity was observed. (See Figure 16) A gradual increase 
in efBcacy and the frequency of pools fiinctionality was observed when the number of 
SiRNAs increased to 3 and 4. (Figures 16, 17). Further, the relative positioning of 
the oligonucleotides within a pool did not detennine whether a particular pool was 
fimctional (see Figure 18, in which 100% of pools of oligonucleotides distanced by 2, 
1 0 and 20 base pairs were fiinctional). 
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However, relative positioning may nonetheless have an impact. An increased 
functionality may exist when the siRNA are positioned continuously head to toe (5' 
end of one directly adjacent to the 3' end of the others). 

5 Additionally, siRNA pools that were tested performed at least as well as the 

best oligonucleotide in the pool, under the experimental conditions whose results are 
depicted in Figure 19. Moreover, when previously identified non-functional and 
marginally (semi) iiinctional siRNA duplexes were pooled together in groups of five 
at a time, a significant fimctional cooperative action was observed. (See Figure 20) 
1 0 In fact, pools of semi-active oligonucleotides were 5 to 25 times more functional than 
the most potent oligonucleotide in the pool. Therefore, pooling several siRNA 
duplexes together does not interfere with the fimctionality of the most potent siRNAs 
within a pool, and pooling provides an unexpected significant increase in overall 
fimctionality 

15 

Example IX. Pooling Across Species 

Experiments were performed on the following genes: P-galactosidase, Renilla 
luciferase, and Secreted alkaline phosphatase, which demonstrates the benefits of 
pooling, (see Figure 21) Approximately 50% of individual siRNAs designed to 
20 silence the above-specified geneS^were fimctional, while 100% of the pools that 
contain the same siRNA duplexes were fimctional. 

Example X. Highly Functional siRNA si^i^;*. 

Pools of five siRNAs in which each two siRNAs overlap to 10-90% resulted 

25 in 98% fimctional entities (>80% silencing). Pools of siRNAs distributed throughout 
the mRNA that were evenly spaced, covering an approximate 20 - 2000 base pair 
range, were also fijnctional. When the pools of siRNA were positioned continuously 
head to tail relative to mRNA sequences and mimicked the natural products of Dicer 
cleaved long double stranded RNA, 98% of the pools evidenced Wghly fimctional 

30 activity (>95% silencing). 



Example XI. Human cyclophyline 
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Table III above lists the siRNA sequences for the human cyclophyline 
protein. A particularly functional siRNA may be selected by applying these 
sequences to any of Formula I to VII above. 

5 Alternatively, one could pool 2, 3, 4, 5 or more of these sequences to create a 

kit for silencing a gene. Preferably, within the kit there would be at least one 
sequence that has a relatively high predicted functionality when any of Formulas I - 
VII is apphed. 

1 0 Example XII. Sample Pools of siKNAs and Their Application to Human Disease 
The genetic basis behind human disease is well documented and siRNA may 
be used as both research or diagnostic tools and therapeutic agents,- either individually 
or in pools. Genes involved in signal transduction, the immune response, apoptosis, 
DNA repair, cell cycle control, and a variety of other physiological ftmctions have 

1 5 clinical relevance and therapeutic agents that can modulate expression of these genes 
may alleviate some or all of the associated symptoms. In some instances, these genes 
can be described as a member of a family or class of genes and siRNA (randomly, 
conventionally, or rationally designed) can be directed against one or multiple 
members of the family to induce a desired result. 

20 

To identify rationally designed siRNA to each gene, the sequence was 
analyzed using Formula VIII to identify a SMARTpool containing the functional 
sequences. To confirm the activity of these sequences, the siRNA are introduced into 
a cell type of choice (e.g. HeLa cells, HEK293 cells) and the levels of the appropriate 

25 message are analyzed using one of several art proven techniques. SiRNA having 

heightened levels of potency can be identified by testing each of the before mentioned 
duplexes at increasingly limiting concentrations. Similarly, siRNA having increased 
levels of longevity can be identified by introducing each duplex into cells and testing 
functionality at 24, 48, 72, 96, 120, 144, 168, and 192 hours after transfection. Agents 

30 that induce >95% silencing at sub-nanomolar concentrations and/or induce functional 
levels of silencing for >96 hours are considered hyperfunctional. 

The following are non-limiting examples of families of proteins to which 
siRNA described in this document are targeted against: 
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Transporters, Pumps, and Channels 

5 Transporters, pumps, and channels represent one class of genes that are 

attractive targets for siRNAs. One major class of transporter molecules are the ATP- 
binding cassette (ABC) transporters. To date, nearly 50 human ABC-transporter 
genes have been characterized and have been shown to be involved in a variety of 
physiological functions including transport of bile salts, nucleosides, chloride ions, 

10 cholesterol, toxins, and more. Predominant among this group are MDRl (which 
encodes the P-glycoprotein, NP_000918), the MDR-related proteins (MRP 1-7), and 
the breast cancer resistance protein (BCRP). hi general, these transporters share a 
common structure, with each protein containing a pair of ATP-binding domains (also 
known as nucleotide binding folds, NBF) and two sets of transmembrane (TM) 

15 domains, each of which typically contains six membrane-spanning a-helices. The 
genes encoding this dass of transporter are organized as either full transporters (i.e. 
containing two TM and two NBF domains) or as half transporters that assemble as 
either homodimers or heterodimers to create functional transporters. As a whole, 
members of the family are widely dispersed throughout the genome and show a high 

20 degree of amino acid sequence identify among eukaryotes. 

ABC-transporters have been implicated in several human diseases. For 
instance, molecular efHux pumps of this type play a major role in the development of 
drug resistance exhibited by a variety ofcancers and pathogenic microorganisms. In 

25 the case of human cancers, increased expression of the MDRl gene and related pumps 
have been observed to generate drug resistance to a broad collection of commonly 
used chemotherapeutics including doxorubicin, daunorubicin, vinblastine, vincristine, 
colchicines. In addition to the contribution these transporters make to the 
development of multi-drug resistance, there are currently 13 human genetic diseases 

30 associated with defects in 1 4 different transporters. The most common of these 

conditions include cystic fibrosis, Stargardt disease, age-related macular degeneration, 
adrenoleukodystrophy, Tangier disease, Dubin-Johnson syndrome and progressive 
familial intrahepatic cholestasis. For this reason, siRNAs directed against members of 
this, and related, famiUes are potentially valuable research and therapeutic tools. 
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With respect to channels, analysis of Drosophila mutants has enabled the 
initial molecular isolation and characterization of several distinct channels including 
(but not limited to) potassium (K+) channels. This list includes shaker (Sh), which 
encodes a voltage activated K"" channel, slowpoke (Slo), a Ca^"^ activated K"^ channel, 
5 and ether-a-go-go (Eag). The Eag family is further divided into three subfamilies: 
Eag, Elk (eag-like K channels), and Erg (Eag related genes). 

The Erg subfamily contains three separate family members (Ergl-3) that are 
distantly related to the sh family of voltage activated channels. Like sh, erg 

1 0 polypetides contain the classic six membrane spanning architecture of channels 
(S 1 -S6) but differ in that each includes a segment associated with the C-terminal 
cytoplasmic region that is homologous to cyclic nucleotide binding domains (cNBD). 
Like many isolated ion channel mutants, erg mutants are tempa-ature-sensitive 
paralytics, a phenotype caused by spontaneous repetitive firing (hyperactivity) in 

1 5 neurons and enhanced transmitter release at the neuromuscular junction. 

Initial studies on the tissue distribution of all three members of the erg 
subfamily show two general patterns of expression. Ergl and erg3 are broadly 
expressed throughout the nervous system and are observed in the heart, the superior 

20 mesenteric ganglia, the celiac ganglia, the retina, and the brain. In contrast, CTg2 
shows a much more restricted pattern of expression and is only observed in celiac 
ganglia and superior mesenteric ganglia. Similarly, the kinetic properties of the three 
erg potassium chaimels are not homogeneous. Ergl and erg2 channels are relatively 
slow activating delayed rectifiers whereas the erg3 current activates rapidly and then 

25 exhibits a predominantly transient component that decays to a sustained plateau. The 
current properties of all three channels are sensitive to methanesulfonanihdes, 
suggesting a high degree of conservation in the pore structure of all three proteins. 

Recently, the erg family of K"" channels has been imphcated in human disease. 
30 Consistent with the observation that ergl is expressed in the heart, single strand 
conformation polymorphism and DNA sequence analyses have identified HERG 
(human ergl) mutations in six long-QT-syndrome (LQT) families, an inherited 
disorder that results in sudden death firom a ventricular tachyarrythmia. Thus siRNA 
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directed against this group of molecules (g.g. KCNHl-8) will be of extreme 

therapeutic value. 



Another group of channels that are potential targets of siRNAs are 
5 the CLCA family that mediate a Ca^"^-activated CP conductance in a variety of 
tissues. To date, two bovine (bCLCl; bCLCA2 (Lu-ECAM-1)), three mouse 
(mCLCAl; mCLCA2; mCLCA3) and four human (hCLCAl; hCLCA2; hCLCAS; 
hCLCA4) CLCA family members have been isolated and patch-clamp studies with 
transfected human embryonic kidney (HEK-293) cells have shown that bCLCAl, 
1 0 mCLCAl , and hCLCAl mediate a Ca^*-activated CP conductance that can be 
inhibited by the anion channel blocker DIDS and the reducing agent dithiotiireitol 
(DTT). 

The protein size, structure, and processing seem to be similar among different 
1 5 CLCA family members and has been studied in greatest detail for Lu-ECAM-1 . The 
Lu-ECAM-1 open reading frame encodes a precursor glycoprotein of 1 30 kDa that is 
processed to a 90-kDa amino-terminal cleavage product and a group of 30- to 40-kDa 
glycoproteins that are glycosylation variants of a single polypeptide derived from its 
carboxy terminus. Both subunits are associated with the outer cell surface, but only 
20 the 90-kDa subunit is thought to be anchored to.the cell membrane via four 
transmembrane domains. 



Although the protein processing and function appear to be conserved among 
CLCA homologs, significant differences exist in their tissue expression patterns. For 
example, bovine Lu-ECAM-1 is expressed primarily in vascular endothelia, bCLCAl 
is exclusively detected in the trachea, and hCLCAl is selectively expressed in a 
subset of human intestinal epithelial cells. Thus the emerging picture is that of a 
multigene family with members that are highly tissue specific, similar to the CIC 
family of voltage-gated CF channels. The human channel, hCLCA2, is particular 
interesting from a medical and pharmacological standpoint. CLCA2 is expressed on 
the luminal surface of lung vascular endothelia and serves as an adhesion molecule 
for lung metastatic cancer cells, thus mediating vascular arrest and lung colonization. 
Expression of this molecule in normal mammary epithelium is consistently lost in 
human breast cancer and in nearly all tumorigenic breast cancer cell lines. Moreover, 
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re-expression of hCLCA2 in human breast cancer cells abrogates tumorigenicity in 
nude mice, implying that hCLCA2 acts as a tumour suppressor in breast cancer. For 
these reasons, siRNA directed against CLCA family members and related channels 
may prove to be valuable in research and therapeutic venues. 

Transporters Involved in Synaptic Transmission. 

Synaptic transmission involves the release of a neurotransmitter into the 
synaptic cleft, interaction of that transmitter with a postsynaptic receptor, and 
subsequent removal of the transmitter from the cleft, hi most synapses the signal is 
terminated by a rapid reaccumulation of the neurotransmitter into presynaptic 
terminals. This process is catalyzed by specific neurotransmitter transporters lhat are 
often energized by the electrochemical gradient of sodium across the plasma 
membrane of the presynaptic cells. 



1 5 Aminobutyric acid (GABA) is the major inhibitory neurotransmitter in the 

central nervous system. The inhibitory action of GABA, mediated through GABAa/ 
GABAb receptors, and is regulated by GABA transporters (GATs), integral 
membrane proteins located perisynaptically on neurons and glia. So far four different 
carriers (GAT1-GAT4) have been cloned and their cellular distribution has been 

20 partly worked out. Comparative sequence analysis has revealed that GABA 

transporters are related to seyeral other proteins involved in neurotransmitter uptake 
including gamma-aminobutyric acid transporters, monoamine transporters, amino 
add transporters, certain "orphan" transporters, and the recently discovered bacterial 
transporters. Each of these proteins has a similar 12 transmembrane helices topology 

25 and rehes upon the Na+/Cl- gradient for transport fimction. Transport rates are 

dependent on substrate concentrations, with half-maximal effective concentrations for 
transport frequently occurring in the submicromolar to low micromolar range. In 
addition, transporter function is bidirectional, and non-vesicular efHux of transmitter 
may contribute to ambient extracellular transmitter levels. 

30 

Recent evidence suggests that GABA transporters, and neurotransmitter 
transporters in general, are not passive players in regulating neuronal signaling; rather, 
transporter fimction can be altered by a variety of initiating factors and signal 
transduction cascades. In general, this fimctional regulation occurs in two ways, 
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either by changing the rate of transmitter flux through the transporter or by changing 
the number of functional transporters on the plasma membrane. A recurring theme in 
transporter regulation is the rapid redistribution of the transporter protein between 
intracellular locations and the cell surface, ha general, this functional modulation 
) occurs in part through activation of second messengers such as kinases, phosphatases, 
arachidonic acid, and pH. However, the mechanisms underlying transporter 
phosphorylation and transporter redistribution have yet to be fully elucidated. 

GABA transporters play a pathophysiological role in a number of human 
) diseases including temporal lobe epilepsy and are the targets of pharmacological 

interventions. Studies in seizure sensitive animals show some (but not all) of the GAT 
transporters have altered levels of expression at times prior to and post seizure, 
suggesting this class of transporter may affect epileptogenesis, and that alterations 
following seizure may be compensatory responses to modulate seizure activity. For 
> these reasons, siRNAs directed against members of this family of genes (including but 
not limited to SLCG6A1-12) may prove to be valuable research and therapeutic tools. 

Organic Ion Transporters. 

The human body is continuously exposed to a great variety of xenobiotics, via 
I food, drugs, occupation, and environment. Excretory organs such as kidney, hver, and 
intestine defend the body against the potentially harmful effects of these compounds 
by transforming them into less active metabolites that are subsequently secreted from 
the system. 



Camer-mediated transport of xenobiotics and their metabolites exist for the 
active secretion of organic anions and cations. Both systems are characterized by a 
high clearance capacity and tremendous diversity of substances accepted, properties 
that result from the existance of multiple transporters with overlapping substrate 
specificities. The class of organic anion transporters plays a critical role in the 
eUmination of a large number of drugs (e.g., antibiotics, chemotherapeutics, diuretics, 
nonsteroidal anti-inflammatory drugs, radiocontrast agents, cytostatics); drug 
metabolites (especially conjugation products with glutathione, glucuronide, glycine, 
sulfate, acetate); and toxicants and their metaboUtes (e g., mycotoxins, herbicides, 
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plasticizers, glutathione 5-conjugates of polyhaloaIkanes,polyhaloaJkenes, 
hydroquinones, aminophenols), many of which are specifically harmful to the kidney. 

Over the past couple of years the number of identified anion transporting 
5 molecules has grown tremendously. Uptake of organic anions (OA") across the 
basolateral membrane is mediated by the classic sodium-dependent organic anion 
transport system, which includes ot-ketoglutarate (a-KG^-)/OA~ exchange via the 
organic anion transporter (OATl) and sodium-ketoglutarate cotransport via the 
NaVdicarboxylate cotransporter (SDCT2). The organic anion transporting polypetide, 
0 Oatpl , and the kidney-specific OAT-Kl and 0AT-K2 are seen as potential molecules 
that mediate facilitated OA" efflux but could also be involved in reabsorption via an 
exchange mechanism. Lastly the PEPTl and PEPT2 mediate luminal uptake of 
peptide drugs, whereas CNTl and CNT2 are involved in reabsorption of nucleosides 

5 The organic anion-transporting polypeptide 1 (Oatpl) is a Na""- and ATP- 

independent transporter originally cloiied from rat liver. The tissue distribution and 
transport properties of the Oatpl gene product are complex. Oatpl is localized to the 
basolateral membrane of hepatocytes, and is found on the apical membrane of S3 
proximal tubules. Studies with transiently transfected cells (e.g. HeLa cells) have 

) indicated that Oatpl mediates transport of a variety of molepules including 

taurocholate, estrone-3-sulfate, aldosterone, Cortisol, and others. The observed uptake 
of taurocholate by Oatpl expressed in X. laevis oocytes is accompanied by efflux of 
GSH, suggesting that transport by this molecule may be glutathione dependent. 

> Computer modeling suggests that members of the Oatp family are highly 

conserved, hydrophobic, and have 12 transmembrane domains. Decreases in 
expression of Oatp family members have been associated with cholestatic liver 
diseases and human hepatoblastomas, making this family of proteins of key interest to 
researchers and the medical community. For these reasons, siRNAs directed against 

• OAT family members (including but dot limited to SLC21 A2, 3, 6, 8, 9, 1 1, 12, 14, 
1 5, and related transporters) are potentially useful as research and therapeutic tools. 



Nucleoside transporters. 
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Nucleoside transporters play key roles in physiology and pharmacology. 
Uptake of exogenous nucleosides is a critical first step of nucleotide synthesis in 
tissues such as bone marrow and intestinal epithelium and certain parasitic organisms 
that lack de novo pathways for purine biosynthesis. Nucleoside transporters also 
5 control the extracellular concentration of adenosine in the vicinity of its cell surface 
receptors and regulate .processes such as neurotransmission and cardiovascular 
activity. Adenosine itself is used clinically to treat cardiac arrhythmias, and nucleoside 
transport inhibitors such as dipyridamole, dilazep, and draflazine function as coronary 
vasodilators. 

10 

In mammals, plasma membrane transport of nucleosides is brought about by 
members of the concentrative, Na^-dependent (CNT) and equilibrative, Na*- 
independent (ENT) nucleoside transporter families. CNTs are expressed in a tissue- 
specific fashion; ENTs are present inmost, possibly all, cell types and are responsible 

15 for the movement of hydrophilic nucleosides and nucleoside analogs down their 
concentration gradients. In addition, structure/fiinction studies of ENT family 
members have predicted these molecules to contain eleven transmembrane helical 
segments with an amino terminus that is intracellular and a carboxyl terminus that is 
extracellular. The proteins have a large glycosylated loop between TMs 1 and 2 and 

20 a large cytoplasmic loop between TMs 6 and 7. Recent investigations have implicated 
the TM 3-6 region as playing a central role in solute recognition. The medical 
importance of the ENT family of proteins is broad, hi humans adenosine exerts a 
range of.cardicp»tective effects and inhibitors of ENTs are seen as being valuable in 
alleviating a variety of cardio/cardiovascular ailments. In addition, responses to 

25 nucleoside analog dmgs has been observed to vary considerably amongst e.g. cancer 
patients. While some forms of drug resistance have been shown to be tied to the up- 
regulation of ABC-transporters (e.g. MDRl), resistance may also be the result of 
reduced drug uptake {i.e. reduced ENT expression). Thus, a clearer understanding of 
ENT transporters may aid in optimizing drug treatments for patients suffering a wide 

30 range of malignancies. For these reasons, siRNAs directed against this class of 

molecules (including SLC28A1-3, SLC29A1-4, and related molecules) may be usefial 
as therapeutic and research tools. 

Sulfate Transporters.' 
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All cells require inorganic sulfate for normal function. Sulfate is the fourth 
most abundant anion in human plasma and is the major source of sulfur in many 
organisms. Sulfation of extracellular matrix proteins is critical for maintaining 
normal cartilage metabolism and sulfate is an important constituent of myelin 
5 membranes found in the brain 



Because sulfate is a hydrophilic anion that cannot passively cross the lipid 
bilayer of cell membranes, all cells require a mechanism for sulfate influx and efflux 
to ensure an optimal supply. To date, a variety of sulfate transporters have been 

1 0 identified in tissues from many origins. These include the renal sulfate transporters 
(NaSi-1 and Sat-1), the ubiquitously expressed diastrophic dysplasia sulfate 
transporter (DTDST), the intestinal sulfate transporter (DRA), and the erythrocyte 
anion exchanger (AEl). Most, if not all, of these molecules contain the classic 12 
transmembrane spanning domain architecture commonly found amongst members of 

15 the anion transporter superfanuly. 



Recently three different sulfete transporters have been associated with specific 
human genetic diseases. Family members SLC26A2, SLC26A3, and SLC26A4 have 
been recognized as flie disease genes mutated in diastrophic dysplasia, congenital 

20 chloride diarrhea (CLD), and Pendred syndrome (PDS), respectively^ DTDST is a 

particularly complex disorder. The gene encoding this molecule maps to chromosome 
5q, and encodes two distinct transcripts due to alternative exon usage, to contrast to 
other sulfate traHs^pEters {e.g. Sat-1) anion movement by the DTDST protein is 
markedly inhibited by either extracellular chloride or bicarbonate. Impaired function 

25 of the DTDST gene product leads to undersulfation of proteoglycans and a complex 
family of recessively inherited osteochondrodysplasias (achondrogenesis type IB, 
atelosteogenesis type II, and diastrophic dysplasia) with clinical features including but 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities. 
Interestingly, while epidemiological shidies have shown that tiie disease occurs in 

30 most populations, it is particularly prevalent in Finland owing to an apparent founder 
effect. For these reasons, siRNAs directed against this class of genes (including but 
not limited to SLC26A1 -9, and related molecules) may be potentially helpful in both 
therapeutic and research venues. 
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Ion Exchangers 

Intracellular pH regulatory mechanisms are critical for the maintenance of 
countless cellular processes. For instance, in muscle cells, contractile processes and 
metaboHc reactions are influenced by pH. Duringperiods of increased energy 
5 demands and ischemia, muscle cells produce large amounts of lactic acid that, without 
quick and efficient disposal, would lead to acidification of the sarcoplasm. 

Several different transport mechanisms have evolved to maintain a relatively 
constant intracellularpH. The relative contribution of each of these processes varies 

) with cell type, the metabolic requirements of the cell, and the local environmental 
conditions. Litracellular pH regulatory processes that have been characterized 
functionally include but are not limited to the Na'^/HT' exchange, the Na(HC03)„ 
cotransport, and the Na'^-dependent and -independent C^^ase exchangers. As 
bicarbonate and CO2 comprise the major pH buffer of biological fluids, sodium 

J biocarbonate cotransporters (NBCs) are critical. Studies have shown that these 
molecules exist in numerous tissues including the kidney, brain, liver, cornea, heart, 
and lung, suggesting that NBCs play an important role in mediating HCO3- transport 
in both epithelial as well asnonepithelial cells. 



Recent molecular cloning experiments have identified the existence of four 
NBC isoforms (NBCl. 2, 3 and 4) and two NBC-related proteins, AE4 and NCBE 
(Anion Exchanger 4 and Na-dependent Chloride-Bicarbonate Exchanger). The 
secondary structuE&aaal^ffies and hydropathy profile of this family predict them to be 
intrinsic membrane proteins with 12 putative transmembrane domains and several 
family members exhibit ^■-linked glycosylation sites, protein kinases A and C, casein 
kinase II, and ATP/GTP-binding consensus phosphorylation sites, as well as potential 
sites for myristylation and amidation. AE4 is a relatively recent addition to this 
family of proteins and shows between 30-48% homology with the other family 
members. When expressed in COS-7 cells andXenopus oocytes AE4 exhibits sodium- 
independent and DIDS-insensitive anion exchanger activity. Exchangers have been 
shown to be responsible for a variety of human diseases. For instance, mutations in 
three genes of the anion transporter family (SLC) are believed to cause known 
hereditary diseases, including chondrodysplasia (SLC26A2, DTD), diarrhea (A3, 
down-regulated in adenoma/chloride-losing diarrhea protein: DRA/CLD), and 
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goiter/deafiiess syndrome (A4, pendrin). Moreover, mutations in Na+/HC03 co- 
transporters have also been associated wifli various human maladies. For these 
reasons, siRNAs directed against these sorts of genes {e.g. SLC4A4-10, and related 
genes) may be useful for therapeutic and research purposes. 

Receptors Involved in Synaptic Transmission 

In all vertebrates, fast inhibitory synaptic transmission is the result of the 
interaction between the neurotransmitters glycine (Gly) and y-aminobutyric acid 
(GABA) and their respective receptors. The strychnine-sensitive glycine receptor is 
especially important in that it acts in the mammalian spinal cord and brain stem and 
has a well-established role in the regulation of locomotor behavior. 

Glycine receptors display significant sequence homology to several other 
receptors including the nicotinic acetylcholine receptor, the aminobutyric acid 
receptor type A (GABAaR), and the serotonin receptor type 3 (5-HT3R) subunits. As 
members of the superfamily of ligand-gated ion channels, these polypeptides share 
common topological features. The glycine receptor is composed of two types of 
glycosylated integral membrane proteins (al-a4 and p) arranged in apentameric 
suprastructure. The alpha subunit encodes a large extracellular, N-terminal domain 
that carries the structural determinants essential for agonist and antagonist'Binding, 
followed by four transmembrane spanning regions (TM1-TM4), with TM2 playing the 
critical role of forming Ifae inner wall of the chloride channel. 

The density, location, and subunit composition of glycine neurotransmitter 
receptors changes over the course of development. It has been observed that the 
amount of GlyR gene translation (assessed by the injection of developing rat cerebral 
cortex mRNA into Xenopus oocytes) decreases with age, whereas that of GABARs 
increases, hi addition, the type and location of mRNAs coding for GlyR changes over 
the course of development. For instance m a study of the expression of alpha 1 and 
alpha 2 subunits in the rat, it was observed that (in embryonic periods El 1 -1 8) the 
mantle zone was scarce in the alpha 1 mKNA, but the germinal zone (matrix layer) at 
El 1-14 expressed higher levels of the message. At postnatal day 0 (PO), the alpha 1 
signals became manifested throughout the gray matter of the spinal cord. By contrast, 
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the spinal tissues at PO exhibited the highest levels of alpha 2 mRNA, which 
decreased with the postnatal development. 

In both, man and mouse mutant lines, mutations of GlyR subunit genes result 
5 in hereditary motor disorders characterized by exaggerated startle responses and 
increased muscle tone. Padiological alleles of the Glral gene are associated with the 
murine phenotypes oscillator (spd°^) and spasmodic (spd). Sunilarly, a mutant allele 
of Glrb has been found to underly the molecular pathology of the spastic mouse (spa). 
Resembling the situation in the mouse, a variety of GLRAl mutant alleles have been 
1 0 shown to be associated with the human neurological disorder hyperelqplexia or startle 
disease. For these reasons, siRNA directed against glycine receptors (GLRAl -3, 
GLRB, and related molecules), glutamate receptors, GABA receptors, ATP receptors, 
and related neurotransmitter receptor molecules may be valuable therapeutic and 
research reagents. 

15 

Proteases 

KaUikreins 

One important class of proteases are the kallikreins, serine endopeptidases that 
20 split peptide substrates preferentially on the C-terminal side of internal arginyl and 
lysyl residues. Kallikreins are generally divided into two distinct groups, plasma 
kallikreins and tissue kallikreins. Tissue kallikreins represent a large group of 
enzymes that have substantial similarities at both the gene and protein level. The 
genes encoding this group are frequently found on a single chromosome, are 
25 organized in clusters, and are expressed in a broad range of tissues {e.g. pancreas, 
ovaries, breast). In contrast, the plasma form of the enzyme is encoded by a single 
gene {e.g. KLK3) that has been localized to chromosome 4q34-35 in humans. The 
gene encoding plasma kallikrein is expressed solely in the liver, contains 15 exons, 
and encodes a glycoprotein that is translated as a preprotein called prekallikrein. 

30 

Kallikreins are believed to play an important role in a host of physiological 
events For instance, the immediate consequence of plasma prekallikrein activation is 
the cleavage of high molecular weight kininogen (HK) and the subsequent liberation 
of bradykinin, a nine amino acid vasoactive peptide that is an important mediator of 
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inflammatory responses. Similarly, plasma kallikxein promotes single-chain urokinase 
activation and subsequent plasminogen activation, events that are critical to blood 
coaggulation and wound healing. 



Disruptions in the ftinction of kallikreins have been implicated in a variety of 
pathological processes including imbalances in renal function and inflammatory 
processes. For these reasons, siRNAs directed against this class of genes (e.g. KLKl- 
1 5) may prove valuable in both research and therapeutic settings. 

ADAM Proteins 

The process of fertilization takes place in a series of discrete steps whereby the 
sperm interacts with;^i) the cumulus cells and the hyaluronic acid extracellular matrix 
(ECM) in which they are embedded, ii) the egg's own ECM, called the zona pellucida 
(ZP), and iii) the egg plasma membrane. During tiie course of these interactions, tiie 
"acrosome reaction," the exocytosis of the acrosome vesicle on the head of the sperm, 
is induced, allowing flie sperm to penetrate the ZP and gain access to the perivitelline 
space. This process exposes new portions of the sperm membrane, including the inner 
acrosomal membrane and the equatorial segment, regions of the spenn head that can 
participate in initial gamete membrane binding. 

The interactions of the gamete plasma membranes appear to involve multiple 
ligands and receptors and are frequenfly compared to leukocyte-endothelial 
interactions. These interactions-lead-to a series of signal transduction events in the 
egg, known as collectively as egg, activation and include the initiation of oscillations 
in intracellular calcium concentration, the exit from meiosis, the entry into the first 
embryonic mitosis, and the formation of a block to polyspermy via the release of ZP- 
modifying enzymes from the egg's cortical granules. Ultimately, sperm and egg not 
only adhere to each other but also go on to undergo membrane fusion, making one 
cell (the zygote) from two. 

Studies on the process of sperm-egg interactions have identified a number of 
proteins that are crucial for fertilization. One class of proteins, called the ADAM 
family (A Disintegrin And Metalloprotease), has been found to be important in 
spermatogenesis and fertilization, as well as various developmental systems including 
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myogenesis and neurogenesis. Members of the family contain a disintegrin and 
metalloprotease domain (and therefore have (potentially) both cell adhesion and 
protease activities), as well as cysteine-rich regions, epidermal growth factor (EGF)- 
like domains, a transmembrane region, and a cytoplasmic tail. Currently, the ADAM 
5 gene family has 29 members and constituents are widely distributed in many tissues 
including the brain, testis, epididymis, ovary, breast, placenta, liver, heart, lung, bone, 
and muscle. 



One of the best-studied members of the ADAM family is fertilin, a 
1 0 heterodimeric protein comprised of at least two subunits, fertilin alpha and fertilin 
beta. The fiertUin beta gene (ADAM2) has been disrupted with a targeting gene 
construct corresponding-to the exon encoding the fertilin beta, disintegrin domain. 
Sperm from males homozygous for disruptions in this region exhibit defects in 
multiple facets of sperm ftmction including reduced levels of sl^,erm transit from the 
1 5 uterus to the oviduct, reduced sperm-ZP binding, and reduced s^erm-egg binding, all 
of which contribute to male infertility. 

Recently, four new ADAM family members (ADAM 24-27) have bken 
isolated. The deduced amino acid sequences show that all four contain the complete 
20 domain organization common to ADAM family members and Northem Blot analysis ' 
has shown aU four to be specific to the testes. SiRNAs directed against this class of 
genes {e.g. ADAM2 and related proteins) may be usefid as research tools and 
therapeutics directed toward fertilitysaad>birth control. 

25 Aminopeptidases 

Aiiiinopeptidases are proteases that play critical roles in processes such as 
protein maturation, protein digestion in its terminal stage, regulation of hormone 
levels, selective or homeostatic protein turnover, and plasmid stabilization. These 
enzymes generally have broad substrate specificity, occur in several forms and play a 

30 major role in physiological homeostasis. For instance, the effects of bradykinin, 

angiotensin converting enzyme (ACE), and other vasoactive molecules are muted by 
one of several peptidases that cleave the molecule at an intemal position and eliminate 
its ability to bind its cognate receptor {e.g. for bradykinin, the B2-receptor). 
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Among the enzymes that can cleave bradykinin is the membrane boimd 
aminopeptidase P, also referred to as aminoacylproline aminopeptidase, proline 
aminopeptidase; X-Pro aminopeptidase (eukaryote) and XPNPEP2. Aminopeptidase 
P is an aminoacylproline aminopeptidase specific for NH2-tenninal Xaa-proline 
5 bonds. The enzyme i) is a mono-zinc-containing molecule that lacks any of the typical 
metal binding motifs found in other zinc metalloproteases, ii) has an active-site 
configuration similar to that of other members of the MG peptidase family, and iii) is 
present in a variety of tissues including but not limited to the lung, kidney, brain, and 
intestine. 

10 

Aminopeptidases play an important role in a diverse set of human diseases. 
Low plasma concentrations-of aminopeptidase P are a potential predisposing factor 
for development of angio-oedema in patients treated with ACE inhibitors, and 
inhibitors of aminopeptidase P may act as cardioprotectors against other forms of 
1 5 illness including, but not limited to myocardial infarction. For these reasons, siKNAs 
directed against this family of proteins (including but not limited to XPNPEPl and 
related proteins) may be useful as research and therapeutic tools. 

Serine Proteases 

20 ' One important class of proteases are the serine proteases. Serine proteases 

share a common catalytic triad of three amino acids in their active site (serine 
(nucleophile), aspartate (electrophile), and histidine (base)) and can hydrolyze either 
esters or peptide bonds utilizing mechamsms-of covalent catalysis and preferential 
binding of the transition state. Based on the position of their introns serine proteases 

25 have been classified into a minimum of four groups including those in which 1) the 
gene has no introns interrupting the exon coding for the catalytic triad {e.g. the 
haptoglobin gene,); 2) each gene contains an intron just downstream fi-om the codon 
for the histidine residue at the active site, a second intron downsh-eam fi-om the exon 
containing the aspartic acid residue of the active site and a third intron just upstream 

30 firom the exon containing the serine of the active site {e.g. trypsinogen, 

chymotrypsinogen, kallikrein and proelastase); 3) the genes contain seven introns 
interrupting the exons coding the catalytic region {e.g. complement factor B gene); 
and 4) the genes contain two introns resulting in a large exon that contains both the 
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active site aspartatic acid and serine residues (e.g. factor X, factor IX and protein C 



Cytotoxic lymphocytes (e.g. CD8(+) cytotoxic T cells and natural killer cells) 
form the major defense of higher organisms against virus-infected and transformed 
cells. A key function of these cells is to detect and eliminate potentially harmful cells 
by inducing them to undergo apoptosis. This is achieved through two principal 
pathways, both of which require direct but transient contact between the killer cell and 
its target. The first pathway involves ligation of TNF receptor-like molecules such as 
Fas/CD95 to their cognate ligands, and results in mobilization of conventional, 
programmed cell-death pathways centered on activation of pro-apoptotic caspases. 
The second mechanism consists'of a pathway whereby the toxic contents of a 
specialized class of secretory vesicles are introduced into the target cell. Studies over 
the last two decades have identified the toxic components as Granzymes, a family of 
serine proteases that are expressed exclusively by cytotoxic T lymphocytes and 
natural killer (NK) cells. These agents are stored in specialized lytic granules and 
enter the target cell via endocytosis. Like caspases, cysteine proteases that play an 
important role in apoptosis, granzymes can cleave proteins after acidic residues, 
especiaUy aspartic acid, and induce apoptosis in the recipient cell. 

Granzymes have been grouped into three subfamilies according to substrate 
specificity. Members of the granzyme family that have enzymatic activity similar to 
the serine protease chymotrypsin are encodedrby-a-gene cluster termed the 'chymase 
locus'. Similarly, granzymes with trypsin-like specificities are encoded by the 
'tryptase locus', and a third subfamily cleaves after unbranched hydrophobic residues, 
especially methionine, and are encoded by the 'Met-ase locus'. All granzymes are 
synthesized as zymogens and, after clipping of the leader peptide, obtain maximal 
enzymatic activity subsequent to the removal of an amino-terminal dipeptide. 

Granzymes have been found to be important in a number of important 
biological functions including defense against intracellular pathogens, graft vereus 
host reactions, the susceptibility to transplantable and spontaneous malignancies, 
lymphoid homeostasis, and the tendency toward auto-immune diseases. For these 
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reasons, siRNAs directed against granszymes (e.g. GZMA, GZMB, GZMH, GZHK, 
GZMM) and related serine proteases may be useful research and therapeutic reagents. 



Protein Kinases (PKs) have been implicated in a number of biological 
processes. Kinase molecules play a central role in modulating ceUular physiology and 
developmental decisions, and have been implicated in a large list of human maladies 
including cancer, diabetes, and others. 

During the course of the last three decades, over a hundred distinct protein 
kinases have been identified, aU with presumed specific cellular fimctions. A few of 
these enzymes have been isolated to sufficient purity to perfonn in vitro studies, but 
most remain intractable due to the low abundance of these molecules in the cell. To 
counter this technical difficulty, a number of protein kinases have been isolated by 
molecular cloning strategies that utilize the conserved sequences of the catalytic 
domain to isolate closely related homologs. Alternatively, some kinases have been 
purified (and subsequently studied) based on their interactions with other molecules. 

p58 is a member of the p34cdc2-related supergene family and contains a large 
domain that is highly homologous to the cell division control kinase, cdc2. This new 
cell division control-related protem kinase was originally identified as a component of 
semipuiified galactosyltransferase; thus, it has been denoted galactosyltransferase- 
assodated protein kinase (GTA-kinase). GTA=fcinaseJias been found to be expressed 
in both adult and embryonic tissues and is known to phosphorylate a number of 
substrates, including histone HI , and casein. Interestingly enough, over expression of 
this molecule in CHO cells has shown that elevated levels of p58 result in a prolonged 
late telophase and an early Gl phase, thus hinting of an important role for GTA- 
kinase in cell cycle regulation. 

Cvclin Dep endent Kinasas 

The cyclin-dependent kinases (Cdks) are a family of highly conserved 
serine/threonine kinases that mediate many of the cell cycle transitions that occur 
during dupUcation. Each of these Cdk catalytic subunits associates with a specific 
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subset of regulatory subunits, termed cyclins, to produce a distinct Cdk cyclin kinase 
complex that, in general, functions to execute a unique cell cycle event. 



Activation of the Cdk cyclin kinases during cellular transitions is controlled by 
5 a variety of regulatory mechanisms. For the Cdc2-cyclin B complex, inhibition of 
kinase activity during S phase and G2 is accomplished by phosphorylation of two 
Cdc2 residues, Thr"* and Tyr'^ which are positioned within the ATP-binding cleft. 
Phosphorylation of Thr'" and/or Tyr'^ suppresses the catalytic activity of the molecule 
by disrupting the orientation of the ATP present within this cleft. In contrast, the 

1 0 abrupt dephosphorylation of these residues by the Cdc25 phosphatase results in the 
rapid activation of Cdc2 cyclin B kinase activity and subsequent downstream mitotic 
events. While the exact details of this pathway have yet to be elucidated, it has been 
proposed that Thr'^A-yr'^ phosphorylation functions to permit a cell to attain a critical 
concentration of inactive Cdk cyclin complexes, which, upon activation, induces a 

15 rapid and complete cell cycle transition. Fur1hennore,fliere is evidence in 
mammalian cells that Thr"/Tyr'^ phosphorylation also fimctions to delay Cdk 
activation after DNA damage. 

The Schizosaccharomyces pombe weel gene product was the first kinase 
20 identified that is capable of phosphorylating Tyr'^ in Cdc2. Homologs of the Wecl 
kinase have been subsequently identified and biochemically characterized from a wide 
range of species including human, mouse, fi-og, Saccharomyces cerevisiae, and 
Drosophila. In vertebrate systems, where Thr'^in^GdeS^is also phosphorylated, the 
Weel kinase was capable of phosphorylating Cdc2 on Tyr'^ but not Thr'^ indicating 
25 that another kinase was responsible for Thr'" phosphorylation. This gene, Mytl 

kinase, was recently isolated fi-om the membrane fi-actions of Xenopus egg extracts 
and has been shown to be capable of phosphorylating Thr"* and, to a lessor extent, 
Tyr'^ in Cdc2. A human Mytl homolog displaying similar properties has been 
isolated, as well as a non-membrane-assodated molecule with Thr^* kinase activity. 

30 

In the past decade it has been shown that cancer can originate fi-om 
overexpression of positive regulators, such as cyclins, or firom underexpression of 
negative regulators (e.g. pl6 (INK4a), pl5 (INK4b), p21 (Cipl)). Inhibitors such as 
Mytl are the focus of much cancer research because they are capable of controlling 
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cell cycle proliferation, now considered the Holy Grail for cancer treatment. For these 
reasons, siRNA directed against kinases and kinase inhibitors including but not 
limited to ABLl, ABL2, ACKl, ALK, AXL, BLK, BMX, BTK, C20orf64, CSFIR, 
SCK, DDRl, DDR2, DKFZp761P1010, EGFR, EPHAl, EPHA2, EPHA3, EPHA4, 

5 EPHA7, EPHA8, EPHBl, EPHB2, EPHB3, EPHB4. EPHB6, ERBB2, ERBB3, 
ERBB4, PER, FES, FGFRl, FGFR2, FGFR3, FGFR4, FGR, FLTl, FLT3, FLT4, 
FRK, FYN, HCK, IGFIR, INSR, ITK, JAKl, JAK2, JAK3, KDR, KIAA1079, KIT, 
LCK, LTK, LYN, MATK, MERTK, MET, MSTIR, MUSK, NTRKl, NTRK2, 
NTRK3, PDGFRA, PDGFRB, PTK2, PTK2B, PTK6, PTK7, PTK9, PTK9L, RET, 

1 0 RORl , ROR2, ROS 1 , RYK, SRC, SYK, TEC, TEK, TIE, TNKl , TXK, TYK2, 
TYR03, YESl , and related proteins, may be useful for research and therapeutic 
purposes. 

G Protein Coupled Receptors 

15 

One important class of genes to which siRNAs can be directed are G-protein 
coupled receptors (GPCRs). GPCRs constitute a superfamily of seven transmembrane 
spanning proteins that respond to a diverse array of sensory and chemical stimuli, 
such as light, odor, taste, pheromones, hormones and neurotransmitters. GPCRs play a 
20 central role in cell proliferation, differentiation, and have been implicated in the 
etiology of ffisease. 

The mechanism by which G protein-coupled receptors translate extracellular 
signals into cellular changes was initially envisioneB~ as a sunple linear model: 

25 activation of the receptor by agonist binding leads to dissociation of the heterotrimeric 
GTP-binding G protein (Gs, Gi, or Gq) into its alpha and beta/gamma subunits, both 
of which can activate or inhibit various downstream effector molecules. More 
specifically, activation of the GPCR induces a conformational change in the Ga 
subtmit, causing GDP to be released and GTP to be bound in its place. The Ga and 

30 Gp7 subunits then dissociate bom the receptor and interact with a variety of effector 
molecules. For instance in the case of the Gs family, the primary fimction is to 
stimulate the intracellular messenger adenylate cyclase (AC), which catalyzes the 
conversion of cytoplasmic ATP into the secondary messenger cyclic AMP (cAMP). 
In contrast, the Gi family inhibits this pathway and the Gq family activates 
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phospholipases C (PLC), which cleaves phosphatidylinositol 4,5, bisphosphate (PIP2) 
to generate inositol-l,4,5-phosphate (IP3) and diacyl glycerol (DAG). 



More recently, studies have shown that the functions of GPCRs are not limited 
5 to their actions on G-proteins and that considerable cross-talk exists between this 
diverse group of receptor molecules and a second class of membrane bound protems, 
the receptor tyrosine kinases (RTKs). A number of GPCRs such as endothelin-1, 
thrombin, bombesin, and dopamine receptors can activate MAPKs, a downstream 
effector of the RTK/Ras pathway. Interestingly, the interaction between these two 
1 0 famiUes is not unidirectional and RTKs can also modulate the activity of signaling 
pathways traditionally thought to be controUed exclusively by ligands that couple to 
GPCRs. For instance, EGF, which nomally activates the MAPK cascade via the 
EGF receptor can stimulate adenylate cyclase activity by activating Gas. 

1 5 There are dozens of members of the G Protein-Cov5>led Receptor family that 

have emerged as prominent drag targets in the last decade. One non-limiting list of 
potential GPCR-siRNA targets is as follows: 

CMKLRl 

20 CMLl/ GMKLRl (Accession No. Q99788) is a member of the chemokine 

receptor family of GPCRs that may play a role in a number of diseases including 
those involved in inflammation and immunological responses {e.g. asthma, arthritis). 
For this reason, siRNA dnected against this protein may^ro^ie40-be important 
therapeutic reagents. 

25 

Studies of juvenile-onset neuronal ceroid lipofuscinosis (JNCL, Batten 
disease), the most common form of childhood encephalopathy that is characterized by 
progressive neural degeneration, show that it is brought on by mutations in a novel 
lysosomal membrane protein (CLN3). In addition to being imphcated in JNCL, 
30 CLN3 (GPCR-like protein, Accession No. A5721 9) expression shidies have shown 
that the CLN3 mRNA and protein are highly over-expressed in a number of cancers 
(e.g. glioblastomas, neuroblastomas, as well as cancers of the prostate, ovaries, breast, 
and colon) suggesting a possible contribution of this gene to tumor growth. For this 



wo 2004/045543 PCT/IJS2003/036787 
100 

reason, siRNA directed against this protein may prove to be important therapeutic 



CLACR 

5 The calcitonin receptor (CTR/ CALCR, Accession No. NM_001742) belongs 

to "family B» of GPCRs which typically recognized regulatory peptides such as 
parathyroid honnone, secretin, glucagons and vasoactive intestinal polypeptide. 
Although the CT receptor typically binds to calcitonin (CT), a 32 amino acid peptide 
hormone produced primarily by the thyroid, association of the receptor with RAMP 
10 (Receptor Activity Modulating Protein) enables it to readily bind other members of 
the calcitonin peptide family including amylin (AMY) and other CT gene-related 
peptides {e.g. aCGRP and PCGRP). While the primary function of the calcitonin 
receptor pertains to regulating osteoclast mediated bone resorption and enhanced Ca^^ 
excretion by the kidney, recent studies have shown that CT and CTRs may play an 

15 important role in a variety of processes as wide ranging as embryonic/foetal 

development and sperm fimction/physiology. In addition, shidies have shown that 
patients with particular CTR genotypes may be at higher risk to lose bone mass and 
that this GPCR may contribute to the formation of calcium oxalate urinary stones. For 
this reason, siRNA directed against CTR may be useful as therapeutic reagents 

20 -v 
OXTR 

The human oxytocin receptor (CTR, OXTR) is a 389 amino acid polypeptide 
that exhibits the seven transmembrane domain structure and belongs to the Class-I 
(rhodopsin-type) family of G-protein coupled receptors. OTR is expressed in a wide 

25 variety of tissues throughout development and mediates physiological changes 

through G(q) proteins and phospholipase C-beta. Studies on the functions of oxytocin 
and the oxytocin receptor have revealed a broad list of duties. OT and OTR play a 
role in a host of sexual, maternal and social behaviors that include egg-laying, birth, 
milk-letdown, feeding, grooming, memory and learning, hi addition, it has blen 

30 hypothesized that abnormalities in the functionality of oxytocin-OTR receptor-ligand 
system can lead to a host of irregularities including compulsive behavior, eating 
disorders (such as anorexia), depression, and various forms of neurodegenerative 
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diseases. For these reasons, siRNA directed against this gene (NM_000916) may play 
an important role in combating OTR-assodated illnesses. 



EDG GPCRs 

5 Lysophosphatidic acid and other lipid-based hormones/growth factors induce 

their effects by activating signaling pathways through the G-protein coupled receptors 
(GPCRs) and have been observed to play important roles in a number of human 
diseases including cancer, asthma, and vascular pathologies. For instance, during 
studies of immunoglobulin A nephropathy (IgAN), researchers have observed an 
10 enhanced expression of EDG5 (NP_004221) suggesting a contribution of this gene 
product in the development of IgAN. For that reasons, siRNA directed against Edg5 
(NM_004230), Edg4 (NM_004720), Edg7 (Nm_012152) and related genes may play 
an important role in combating human disease. 



15 Genes Involved in Cholesterol Signaling and Biosvnthesis 

Studies on model genetic organisms such as Drosophila and C. elegans have 
led to the identification of a plethora of genes that are essential for early development. 
Mutational analysis and ectopic expression studies have allowed many of these genes 
to be grouped into discreet signal transduction pathways and have shown that these 

20 elements play critical roles in pattern formation and cell differentiation. Disruption of 
one or more of these genes during early stages of development frequently leads to 
birth defects whereas as .alteration of gene function at later stages in Hfe can result in 
tumorigenesis. _ . ^, 

25 One critical set of interactions known to exist in both invertebrates and 

vertebrates is the Sonic Hedgehog-Patched-Gli pathway. Originally documented as a 
Drosophila segmentation mutant, several labs have recently identified human and 
mouse orthologs of many of the pathways members and have successfully related 
disruptions in these genes to known diseases. Pathway activation is initiated with the 

30 secretion of Sonic hedgehog. There are three closely related members of the Shh 
family (Sonic hedgehog. Desert, and Indian) with Shh being the most widely 
expressed form of the group. The Shh gene product is secreted as a small pro-signal 
molecule. To successfully initiate its developmental role, Shh is first cleaved, 
whereupon tiie N-terminal truncated firagment is covalaitly modified with cholesterol. 
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The addition of the sterol moiety promotes the interaction between Shh and its 
cognate membrane bound receptor, Patched (Ptch). There are at least two isoforms of 
the Patched gene, Ptchl andPtch2. Both isoforms contain a sterol-sensing domain 
(SSD); a roughly 180 amino acid cluster that is found m at least seven different 
classes of molecules including those involved in cholesterol biosynthesis, vesicular 
trafBc, signal transduction, cholesterol Ixansport, and sterol homeostasis. In the 

absenceofShh,thePatched protein isanegativeregulatorofthepathway.ln contrast 
bmding of Shh-cholesterol to the Patched receptor releases the negative inhibition 
which that molecule enforces on a G-protein coupled receptor known as Smoothened 
Subsequent activation of Smoothened (directly or indirectly) leads to the triggering of 
a tno of transcription factors that belong to the Gli family. All three factors are 
relatively large, contain a characteristic C2-H2 zinc-fingerpfentamer, and recognize 
one of two consensus sequences (SEQ. ID NO. 0463 GACCACCCA or SEQ ID NO 
0464GAACCACCCA). In the absence of Shh. Gli proteins are cleaved by the 
proteosome and the C-terminally truncated ftagment translocates to the nucleus and 
actsasadominanttranscriptionrepressor. In the pr^ence of Shh-cholesterol Gli 
repressor fonnation is inhibited and full-length Gli fonctions as a transcriptional 
activator. 



Shh and other members of the Shh-PTCH-Gli pathway are expressed in a 
broad range of tissues ie.g. the notochord, the floorplate of the neural tube, the brain 
and the gut) at early stages in development. Not surprisingly, mutations that lead to 
altered protein expression or function have been shown to induce developmental 
abnormalities. Defects in the human Shh gene have been shown to cause 
holoprosencephaly. amidline defect that manifests itself as cleft lip or palate CNS 
septation. and a wide range of other phenotypes. Interestingly, defects in cholesterol 
biosynthesis generate similar Shh-like disorders ie.g. Smith-Lemli-Opitz syndrome) 
suggestmg that cholesterol modification of the Shh gene product is crucial for 
pathway function. Both the Patched and Smoothened genes have also been shown to 

be clmicallyrelevant with Smoothened now being recognized as an oncogene that 
like PTCH-1 and PTCH-2, is believed to be the causative agent of several forms of 
adult tumors. For these reasons, siRNA directed against Smoodiened (SMO, 
NM_00563 1), Patched (PTCH. mn_000264), and additional genes that participate in 

•5? 
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cholesterol signaling, biosynthesis, and degradation, have potentially usefiil research 
and therapeutic apphcations. 

Targeted Pathways . 

5 In addition to targeting siRNA against one or more members of a family of 

proteins, siRNA can be directed against members of a pathway. Thus, for instance, 
siRNA can be directed against members of a signal transduction pathway {e.g. the 
insulin pathway, including AKTl-3, CBL, CBLB, EIF4EBP1, FOXOl A, F0X03A, 
FRAPl, GSK3A, GSK3B, IGFl, IGFIR, INPP5J), INSR, IRSl, MLLT7, PDPKl, 
10 PIK3CA, PIK3CB, PIK3R1, PIK3R2, PPP2R2B, PTEN, RPS6, RPS6KA1, 

RPX6KA3, SGK, TSCl, TSC2, AND XPOl), an apoptotic pathway (CASP3, 6,7,8,9, 
DSHl/2, PI 10, P85, PDKl/2, CATENIN, HSP90, CDC37, P23, BAD, BCLXL, 
BCL2, SMAC, and others), pathways, involved in DNA damage, cell cycle, and other 
physiological (p53,MDM2, CHKl/2, BRCAl/2, ATM, ATR, P151NK4, P27, P21, 
1 5 SKP2, CDC25C/A, 14-3-3, PLK, RB, CDK4, GLUT4, Inos, Mtor, FKBP, PPAR, 
RXR, ER). Sunilarly, genes involved in immune system fimction including TNFRl, 
IL-IR, IRAKl/2, TRAF2, TRAF6, TRADD, FADD, IKKs, KKy. KKp, IKKa, 
IkBa, ncBP, p50, p65, Rac, RhoA, Cdc42, ROCK, Pakl/2/3/4/5/6, cIAP, HDACl/2, 
CBP, p-TrCP, Rip2/4, and others are also important targets for the siRNAs described 
20 in this document and may be useiul in treating immune system disorders. Genes 

involved in apoptosis, such as Dshl/2,PTEN, PllO (pan), P85, PDKl/2, Aktl, Akt2, 
Akt (pan), p70^*^, GSK3P, PP2A (cat), p-catenin, HSP90, Cdc37/p50, P23, Bad, 
BclxL, Bcl2, Smac/Diablo, and Askl are potentially useful in the treafBrefit-"bf 
diseases that iovolve defects in programmed cell death (e.g. cancer), while siRNA 
25 agents directed against p53, MDM2, Chkl/2, BRCAl/2, ATM, ATR, pl5^^\ Y21, 
P21, Skp2, Cdc25C/A, 14-3-3a/8, PLK, Rb, Cdk4, Glut4, iNOS, mTOR, FKBP, 
PPARy, RXRa,.ERa and related genes may play a critical role in combating diseases 
associated with disruptions in DNA repair, and cell cycle abnormalities. 

30 Tables VI -Table X below provide examples of useful pools for inhibiting 

different genes in the human insulin pathway and tyrosine kinase pathways, proteins 
involved in the cell cycle, the production of nuclear receptors, and other genes. These 
particular pools are particularly useful in humans, but would be usefiil in any species 
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that generates an appropriately homologous mRNA. Further, within each of the listed 
pools any one sequence maybe used independently but preferably at least two of the 
hsted sequences, more preferably at least three, and most preferably all of the listed 
sequences for a given gene is present. 
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GSK3A 








-003009-05 G 


GACAAAGGTGl iCAAATG 




GSK3A N 
GSK3A ^ 


M 019884 1 
}J\ 019884 1 


1995473 2 
1995473 2 


331 D 
931 D 


-003009-06 G 


AACGCAGCTGCCTAACAA 


502 


GSK3A N 
GSK3A N 


M 019884 1 
M 019884 1 


1995473 2 
1995473 2 


931 D 
931 C 


-003009-07 C 
1-003009-08 C 


CGCACAGCTTCTTTGATG 
CTCTAGCCTGCTGGAGTA 


503 


GSK3B 
GSK3B N 


M 002093 2 


1361339 2 


932 C 


-003010-05 C 
)-003010-06 C 


jAAGAAAGATGAGGTCTAT 
JGACCCAAATGTCAAACTA 


506 


GSK3B N 
GSK3B ^ 


M 002093 2 
M 002093 2 


1361339 2 
1361339 2 


932 C 
932 C 


)-003010-07 C 


SAAATGAACCGAAACTACA 


507 


GSK3B h 


JM 002093 2 


1361339 2 


932 |D-003010-08 C 


3ATGAGGTCTATG 1 1 aA 1 G 




IGF1 

IGF1 ^ 


«1M 000618 




[ 


3-003011-05 ( 
D-003011-06 / 


3GAAGTACATTTGAAGAAC 
\GAAGGAAGTAGATTTGAA 


509 


IGF1 r 
IGF1 


m 000618 
vlM 000618 




I 


D-003011-07 


DCTCAAGCCTGCCAAGTCA 


511 


1GF1 


vJM 000618 






3-003011-08 


3GTGGATGCTCTTCAGTTC 


512 


IGF1R 
1GF1R 
IGF1R 


^IM 000875 
NM 000875 


1 lUDOUU^ 

11068002 


3480 


D-003012-05 
D-0030 12-06 


CAACGAAGGTTCTGTGATG 
GGGCAGAAATGGAGAATAA 
GAAGCACCCTTTAAGAATG 


513 
514 
515 


1GF1R 
IGF1R 
1NPP5D 


NM 000875 
NM 000875 


11068002 
11068002 


3480 
3480 


D-003012-07 
D-003012-08 


GCAGACAGCTACAACATGA 
GGAATTGCGTTTACACTTA 


516 
517 


1NPP5D 
INPP5D 


NM 005541 
NM 005541 


5031798 
5031798 


3635 
3635 


D-003013-05 
D-003013-06 


GGAAACTGATCATTAAGAA 


518 


INPP5D 
INPP5D 
INSR 
INSR 


NM 005541 
NM 005541 

NM 000208 


5031798 
5031798 

4557883 


3635 
3643 


D-0030 1 3-07 
D-003013-08 

D-003014-05 


CGACAGGGATGAAGTACAA 
AAACGCAGCTGCCCATCTA 

GGAAGACGTTTGAGGAl lA 


519 
520 

521 


INSR 
INSR 
INSR 
iRS1 


NM 000208 
NM 000208 
NM 000208 


4557883 
4557883 
4557883 


3643 

— 
3643 — 


D-003014-06 
D-0030 1 4-07 
D-003014-08 


GAACAAGGCTCCCGAGAGT 
GGAGAGACCTTGGAAATTG 
GGAGGGAACCCACGTAl 1 i 

AAAGAGGTCTG6CAAGTGA 


522 
523 
524 

525 


IRS1 
IRS1 


NM 005544 
NM 005544 


5031804 
5031804 


3667-i 


D-00301 5-05 
D-003015-06 


GAACCTGATTGGTATCTAC 


526 


!RS1 


NM 005544 


5031804 


3667 


D-00301 5-07 


CCACGGCGATCTAGTGCTT 




IRS1 
MLLT7 


NM 005544 


5031804 


3667 


D-00301 5-08 
D-00301 6-05 


GTCAGTCTGTCGTCCAGTA 

GGACTGGACTTGAAC 1 1 IG 


528 
529 


MLLT7 
MLLT7 
MLLT7 


NM 005938 
NM 005938 
NM 005938 


5174578 
5174578 
5174578 


4303 
4303 
4303 


D-003016-06 
D-00301 6-07 
D-003016-06 


CCACGAAGCAGTTGAAAXG 
GAGAAGCGAGTGACACTTG 
GACCAGAGATCGCTAACCA 


531 
532 


MLLT7 
PDPK1 


NM 005938 


5174578 


4303 








PDPK1 
PDPK1 


NM 002613 
NM 002613 


4505694 


5170 
5170 


D-00301 7-0£ 
D-00301 7-OE 
D-00301 7-0' 


) CAAGAGACCTCGTGGAGAA 
> GACCAGAGGCCAAGAATTT 
r GGAAACGAGTATCTTATAT 


533 
534 
535 


PDPK1 
PDPK1 
PIK3CA 


NM 002613 
NM 002613 


4505694 
4505694 


5170 
5170 


D-00301 7-0 


i GAGAAGCGACATATCATAA 


536 


PIK3CA 


NM 006218 


5453891 


5290 


D-00301 8-0 


5 GCTATCATCTGAACAATTA 


537 


P1K3GA 


NM 006218 


5453891 


5290 


D-00301 8-0 


5 GGATAGAGGCCAAATAATA 


538 


PIK3CA 


NM 006218 


5453891 


5290 


D-00301 8-0 


7 GGACAACTGTTTCATATAG 


53G 


PIK3CA 
PIK3CB 


NM 006218 


5453891 


5290 


D-00301 8-0 
D-00301 9-0 


8 GCCAGTACCTCATGGATTA 
5 CGACAAGACTGCCGAGAG/ 


54C 
^ 541 


PiK3CB 
PIK3CB 
PIK3CB 
PIK3CB 
PIK3R1 


NM 006219 
NM 006219 
NM 006219 
NM 006219 


5453893 
5453893 
5453893 
5453893 


5291 
5291 
5291 
5291 


D-00301 9-0 
D-00301 9-0 
D-00301 9-0 


6 TCAAGTGTCTCCTAATATG 

7 GGATTCAGTTGGAGTGATT 

8 TTTCAAGTGTCTCCTAATA 


54: 
54. 
54 



ins 
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PIK3R1 


NM 181504 


32455251 


5295 


D-003020-05 


GGAAATATGGCTTCTCTGA 


545 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-06 


GAAAGACGAGAGACCAATA 


546 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-07 


GTAAAGCATTGTGTCATAA 


547 


PIK3R1 


NM 181504 


32455251 


5295 


D-003020-08 


GGATGAAGTTGTCAAAGAA 


548 


PIK3R2 














PIK3R2 


NM 005027 


4826907 


5296 


D-003021-05 


GGAAAGGCGGGAACAATAA 


54S 


PIK3R2 


NM 005027 


4826907 


5296 


D-003021-06 


GATGAAGCGTACTGCAATT 


550 


P1K3R2 


NM 005027 


4826907 


5296 


D-003021-07 


GGACAGCGAATCTCACTAC 


551 


P1K3R2 


NM 005027 


4826907 


5296 


D-003021-08 


GGAAGATCCGAGACCAGTA 


552 


PPP2R2B 














PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-05 


GAATGCAGCTTAGTTTCTT 


552 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-06 


GACCGAAGCTGACATTATC 


554 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-07 


TCGATTAGGTGAAGAGTTT 


55E 


PPP2R2B 


NM 004576 


4758953 


5521 


D-003022-08 


CCTGAAGAGTTTAGAAATA 


556 


PTEN 














PTEN 


NM 000314 


4506248 


5728 


D-003023-05 


GTGAAGATCTTGACCAATG 


557 


PTEN 


NM 000314 


4506248 


5728 


D-003023-06 


GATGAGCATACACAAATTA 


55£ 


PTEN 


NM 000314 


4506248 


5728 


D-003023-07 


GGCGCTATGTGTATTATTA 


55£ 


PTEN 


NM 000314 


4506248 


5728 


D-003023-08 


GTATAGAGCGTGCAGATAA 


56C 


RPS6 














RPS6 


NM 001010 


17158043 


6194 


D-003024-05 


GCCAGAAACTCATTGAAGT 


561 


RPS6 


NM 001010 


17158043 


6194 


D-003024-06 


GGATATTCCTGGACTGACT 




RPS6 


NM 001010 


17158043 


6194 


D-003024-07 


CCAAGGAGAACTGGAGAAA 




RPS6 


NM 001010 


17158043 


6194 


D-003024-08 


GCGTATGGCCACAGAAGTT 


564 


RPS6KA1 














RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-05 


GATGACACCTTCTACTTTG 


56E 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-06 


GAGAATGGGCTCCTCATGA 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-07 


CAAGCGGGATCCTTCAGAA 


— H? 


RPS6KA1 


NM 002953 


20149546 


6195 


D-003025-08 


CCACCGGCCTGATGGAAGA 


566 


RPS6KA3 














RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-05 


GAAGGGAAGTTGTATCTTA 


56E 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-06 


GAAAGTATGTGTATGTAGT 


57C 


RPS6KA3 


NM 004586 


4759049 


6197. - 


•O-003026-07 


GGACAGCATCCAAACATTA 


571 


RPS6KA3 


NM 004586 


4759049 


6197 


D-003026-08 


GGAGGTGAATTGCTGGATA 


575 


SGK 














SGK 


NM 005627 


5032090 


6446 


D-003027-01 


TTAATGGTGGAGAGTTGTT 


572 


SGK 


NM 005627 


5032090 


6446 • 


D-003027-04 


ATTAACTGGGATGATCTCA 


57A 


SGK 


NM 005627 


25168262 


6446 


D-003027-05 


GAAGAAAGCAATGCTGAAA . 


57J 


SGK 


NM 005627 


25168262 


6446 


D-003027-06 


AAACACAGCTGAAATGTAC ' 


57€ 


TSC1 














TSC1 


NM 000368 


24475626 


7248 


D-003028-05 


GAAGATGGCTATTCTGTGT 


577 


TSC1 


NM 000368 


24475626 


7248 


D-003028-06 


TATGAAGGCTCGAGAGTTA 


57£ 


TSC1 


NM 000368 


24475626 


7248 


D-003028-07 


CGACACGGGTGATAACTGA 


57J 


TSC1 


NM 000368 


24475626 


7248 


D-003028-08 


CGGCTGATGTTGTTAAATA 


58C 


TSC2 














TSC2 


NM 000548 


10938006 


7249 


D-003029-05 


GCATTAATCTCTTACCATA 


581 


TSC2 


NM 000548 


10938006 


7249 


D-003029-06 


CCAATGTCCTCTTGTCTTT 


581 


TSG2 


NM 000548 


10938006 


7249 


D-003029-07 


GGAGACACATCACCTACTT 


58J 


TSC2 


NM 000548 


10938006 


7249 


D-003029-08 


TCACCAGGCTCATCAAGAA 


58^ 


XP01 














XP01 


NM 003400 


8051634 


7514 


D-003030-05 


GAAAGTCTCTGTCAAAATA 


58{ 


XP01 


NM 003400 


8051634 


7514 


D-003030-06 


GCAATAGGCTCCATTAGTG 


58{ 


XP01 


NM 003400 


8051634 


7514 


D-003030-07 


GGAACATGATCAACTTATA 


587 
58( 


XP01 


NM 003400 


8051634 


7514 


D-003030-08 


GGATACAGATTCCATAAAT 
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Table Vn 



Gene 


Acc# 


Gl 


LL 


Duplex # 


SeQuenc6 


r-— 


Name 










in 


ABL1 












— 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-05 


GGAAATCAGTGACATAGTG 


— 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-06 


GGTCCACACTGCAATGTTT 


— 1 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-07 


G AAG G AAATCAGTG AC AT A 


— 1 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-08 


TCACTGAGTTCATGACCTA 


— 1 


ABL2 












— - 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -05 




— 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -06 


G A r! n r A A ATTTP^T ATT A A 


— 1 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -07 


RTA ATA Arjr'PTAr* AftTr'TA 


— 1 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -08 


r^riAr^Tri A AnTTr*^^T/^TA a 


5 


ACK1 














ACK1 


NM 005781 


8922074 


10188 


D-0031 02-05 


AA APf^PA AnTr*r2T/^r^ ATr^ A 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-06 


f^PAAr5Tr*f2Tr*r^ATr^ A^TA A 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-07 


AnPAPTAr^PTr* A/"» A ATr^ A 


5 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-08 


TCAGCAGCACCCACTA I I A 


6 


ALK 














ALK 


NM 004304 


29029631 


238 


D-0031 03-05 


riAPAAriATOf^T^r^AO A ATA 

omumaLjA I OL. 1 oUAoAA 1 A 


6 


ALK 


NM 004304 


29029631 




D-0031 03-06 


GGAAGAGTGTGGCAGTTGA 


6 


ALK 


NM 004304 


29029631 


238 


D-0031 03-07 


GCACGTGGCTCGGGACATT 


6 


ALK 


NM 004304 


29029631 


— 238^ 


D-0031 03-08 


GAACTGCAGTGAAGGAACA 


6 


AXL 














AXL 


NM 021913 


21536465 


— 558 


D-0031 04-05 


oo 1 OAoAoO 1 ouAouA 1 1 1 


6 


AXL 


NM 021913 


21536465 


— 558 


D-0031 04-06 


GAAAGAAGGAGACCCGTTA 


6 


AXL 


NM 021913 


21536465 


— 558 


D-0031 04-07 


CCAAGAAGATCTACAATGG 


6 


AXL 


NM 021913 


21536465 


558 


D-0031 04-08 


G GAACTGCATGCTGAATG A 


6 


BLK 














BLK 


NM 001715 


4502412 


640 


D-0031 05-05 


P Ar;f^ATnpr»Tr^/^Tr*o a i i i 


6 


BLK 


NM 001715 


450241 2 


640 


D^031 05-06 


ACATGAAGGTGGGCA 1 1 AA 


6 


BLK 


NM 001715 


450241 2 


640 


D-0031 05-07 


GGTCAGC6CCCAAGACAAG 


6 


BLK 


NM 001715 


450241 2 


640 


D-0031 05-08 


GAAACTCGGGTCTGGAGAA 


6 


BMX 














BMX 


NM 001721 


21359831 


660 


D-0031 06-05 


A A AP A A APr*TTT/^r^T A r^T A 
AAMl_.AAA(^L. 1 1 1 V^O 1 AO 1 A 


6 


-BMX 


NM 001721 


21359831 


660 


D-0031 06-06 


GAAGGAGCAl 1 1 ATGG 1 1 A : 




BMX 


NM 001721 


21359831 




D-0031 06-07 


GAGAAGAGATTACC rTGTT 


6 


BMX 


NM 001721 


21359831 


660 


D-0031 06-08 


GTAAGGCTGTGAATGATAA 


6 


BTK 














BTK 


NM 000061 


4557376 


— ^5 


D-0031 07-05 


T 

GAACAGGAATGGAAGCTTA 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-06 


f^OTATPr^OPTrf^r*^^ A A A 1 ■ 1 

1 A 1 ootjV->HjOOAAAI 1 1 


6 


BTK 


NM 000061 


4557376 




D-0031 07-07 


G AAAG C AACTTACC ATGGT 


6 


BTK 


NM 000061 


4557376 


695 


D-0031 07-08 


GGTAAACGATCAAGGAGTT 


6: 


C20orf64 














C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-05 


CAACTTAGCCAAGACAATT 


6 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-06 


GAAATTGAAGGCTCAGTGA 


6 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-07 


TGGAACAGCTGAACATTGT 


6: 


C20orf64 


NM 033550 


19923655 


11285 


D-0031 08-08 


GCTTCGAACTGCTTATATA 


6: 


CSF1R 














CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-05 


GGAGAGCTCTGACGTTTGA 


6; 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-06 


CAACAACGCTACCTTCCAA 


6. 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-07 


CCAGGCAGCTGCCTTACAA 


6: 


CSF1R 


NM 005211 


27262658 


1436 


D-0031 09-08 


GGAACAACCTGCAGTTTGG 


6: 


CSK 
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CSK 


NM 004383 


4758077 


1445 


D-0031 10-05 


CAGAATGTATTGCCAAGTA 


6 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-06 


GAACAAAGTCGCCGTCAAG 


6 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-07 


GCGAGTGCCTTATCCAAGA 


6 


CSK 


NM 004383 


4758077 


1445 


D-0031 10-08 


GGAGAAGGGCTACAAGATG 


6 


DDR1 














DDR1 


NM 013994 


7669484 


780 


D-0031 11-05 


GGAGATGGAGTTTGAGTTT 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11-06 


CAGAGGCCCTGTCATCTTT 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11 -07 


GCTGGTAGCTGTCAAGATC 


6 


DDR1 


NM 013994 


7669484 


780 


D-0031 11-08 


TGAAAGAGGTGAAGATCAT 


6 


DDR2 














DDR2 


NM 006182 


5453813 


4921 


D-0031 12-05 


GGTAAGAACTACACAATCA 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-06 


GAACGAGAGTGCCACCAAT 


6 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-07 


ACACCAATCTGAAGTTTAT 


6 




INIVI UUoloZ 


545381 3 


4921 


D-0031 1 2-08 


CAACAAGAATGCCAGGAAT 


_6 


DKFZp761 
P1010 














DKFZp761 
P1010 


NM_018423 


8922178 



55359 


D-0031 13-05 


CCTAGAAGCTGCCATTAAA 


"~6 


P1010 


NM_01 8423 


8922178 


55359 


D-0031 1 3-06 


GATTAGGCCTGGCTTA.TGA 


6 


DKFZp761 
P1010 


NM_018423 


8922178 


55359 


D-0031 13-07 


CCCAGTAGCTGCACACATA 


6 


DKFZp761 
P1010 


NM_0ia423 


8922178 


55359 


D-0031 13-08 


GGTGGTAGCTGAACTGTAT 


6 


EGFR 














EGFR 


NM 005228 


4885198 


1956 


D-0031 14-05 


GAAGGAAACTGAATTCAAA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-06 


GGAAATATGTACTACGAAA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-07 


CCACAAAGCAGTGAATTTA 


6 


EGFR 


NM 005228 


4885198 


1956 


D-0031 14-08 


GTAACAAGCTCACGCAGfr 


6 


EPHA1 














EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-05 


GACGAGAGCTTCACCATTC 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-0031 1 5-06 


GCAAGACTGTGGCCATTAA 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-003T15-07 


GGGCGAACCTGACCTATGA 


6 


EPHA1 


NM 005232 


4885208 


2041 


D-0031 15-08 


GATTGTAGCCGTCATGTTT 


6 


EPHA2 














EPHA2 


NM 004431 


4758277 


1969 


D-0031. 16-05 


GGAGGGATGTGGCAACTTG 


6 


EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-06 


GCAGCAAGGTGCACGAATT 


6 


E-PHA^ 


NM 004431 


4758277 


1969 


D-0031 16-07 


GGAGAAGGATGGCGAGTTC- 




EPHA2 


NM 004431 


4758277 


1969 


D-0031 16-08 


GAAGTTCACTACCGAGATC 


6 


EPHA3 














EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-05 


GATCGGACCTCCAGAAATA 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-003117-06 


GAACTCAGCTCAGAAGATT 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-003117-07 


GCAAGAGGCACAAATGTTA 


6 


EPHA3 


NM 005233 


21361240 


2042 


D-0031 17-08 


GAGCATCAGTrTACAAAGA 


6 


EPHA4 














EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-05 


GGTGTGGGATGAAGTATTT 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-06 


GAATGAAGTTACCTTATTG 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-07 


GAACTTGGGTGGATAGCAA 


6 


EPHA4 


NM 004438 


4758279 


2043 


D-0031 18-08. 


GAGATTAAATTCACCTTGA 


6 


EPHA7 














EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-05 


GAAAAGAGATGTTGCAGTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-06 


CTAGATGCCTCCTGTATTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-0031 19-07 


AGAAGAAGGTTATCGTTTA 


6 


EPHA7 


NM 004440 


4758281 


2045 


D-003119-08 


TAGCAAAGCTGACCAAGAA 


6 


EPHA8 














EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-05 


GAAGATGCACTATCAGAAT 


"~6 
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EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-06 


GAGAAGATGCACTATCAGA 




EPHA8 


NM 020526 


18201903 


2046 


D-003 120-07 


AACGTGATCTCCAGTGTGA 




EPHA8 


NM 020526 


18201903 


2046 


D-0031 20-08 


TCTCAGACCTGGGCTATGT 




EPHB1 














EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -05 


GCGATAAGCTCCAGCATTA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -06 


GAAACGGGGTTATAGCAAA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -07 


GGATGAAGATCTACATTGA 




EPHB1 


NM 004441 


21396502 


2047 


D-0031 21 -08 


GCACGTCTCTGTCAACATG 




EPHB2 














EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-05 


ACTATGAGCTGCAGTACTA 




EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-06 


GTACAACGCCACAGCCATA 




EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-07 


GGAAAGGAATGACTGTTCT 




EPHB2 


NM 017449 


17975764 


2048 


D-0031 22-08 


CGGACAAGGTGCAACACTA 




EPHB3 














EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-05 


GGTGTGATCTCGAATGTGA 




EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-06 


GGGATGACCTCCTGTACAA 




EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-07 


CAGAAGACCTGCTCCGTAT 




EPHB3 


NM 004443 


17975767 


2049 


D-0031 23-08 


GAGATGAAGTACTTTGAGA 




EPHB4 














EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-05 


GGACAAACACGGACAGTAT 




EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-06 


GTACTAAGGTCTACATCGA 




EPHB4 


NM 004444 


17975769 


2050 


D-0031 24-07 


GGAGAGAAGCAGAATATTC 




EPHB4 


NM 004444 


17975769 


2050 


D-G031 24-08 


GCCAATAGCCAGTCTAACA 




EPHB6 














EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-05 


GGAAGTCGATCGTGCTTAT 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-06 


GGACGAAGGTGGACACAAT 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-07 


TGTGGGAAGTGATGAGTTA 




EPHB6 


NM 004445 


4758291 


2051 


D-0031 25-08 


CGGGAGACCTTCACCGTTT 




ERBB2 














ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-05 


GGACGAATTCTGCAGAATG 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-06 


GACGAATTCTGCACAATGG 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26-07 


CTACAACACAGACACGTTT 




ERBB2 


NM 004448 


4758297 


2064 


D-0031 26.-081'^ 


AGACGAAGCATACGTGATG 




ERBB3 














ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-05 


AAGAGGATGTCAACGGTTA 




ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-06 


GAAGACTGCCAGACATTGA 




ERBB3 


NM 001982 


4503596 


2065 


D-0031 27-07 


GACAAACACTGGTGCTGAT 




ERBB3,. 


NM 001982 


4503596 


2065 


D-0031 27-08 


GCAGTGGATTCGAGAAGTG 




ERBB4 














ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-05 


GAGGAAAGATGCGAATTAA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-06 


GCAGGAAACATCTATATTA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-07 


GATCACAACTGCTGCTTAA 




ERBB4 


NM 005235 


4885214 


2066 


D-0031 28-08 


CCTCAAAGATACCTAGTTA 




FER 














PER 


NM 005246 


4885230 


2241 


D-0031 29-05 


GGAGTGACCTGAAGAATTC 




FER 


NM 005246 


4885230 


2241 


D-0031 29-06 


TAAAGCAGATTCCGATTAA 




FER 


NM 005246 


4885230 


2241 


D-0031 29-07 


GGAAAGTACTGTGCAAATG 




FER 


NM 005246 


4885230 


2241 


D-0031 29-08 


GAACAAGGGCTGGTAAAGA 




FES 














FES 


NM 002005 


13376997 


2242 


D-0031 30-05 


CGAGGATCCTGAAGGAGTA - 




FES 


NM 002005 


13376997 


2242 


D-0031 30-06 


AGGAATACCTGGAGATTAG 




FES 


NM 002005 


13376997 


2242 


D-0031 30-07 


CAACAGGAGCTCGGGAATG 




FES 


NM 002005 


13376997 


2242 


D-0031 30-08 


GGTGTTGGGTGAGCAGATT 




FGFR1 














FGFR1 


NM 000604 


13186232 


2260 


D-0031 31-05 


TAAGAAATGTCTCCTTTGA 




FGFR1 INM 000604 


13186232 


2260 


D-0031 31 -06 


GAAGACTGCTGGAGTTAAT 
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FGFR1 


NM 000604 


13186232 


2260 


D-003131-07 


GATGGTCCGTTGTATGTCA 


7 


FGFR1 


NM 000604 


13186232 


2260 


D-003131-08 


CTTAAGAAATGTCTGCTTT 


7 


FGFR2 














FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-05 


CCAAATCTCTCAAGCAGAA 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-06 


GAACAGTATTCACGTAGTT 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-07 


GGCCAACACTGTCAAGTTT 


7 


FGFR2 


NM 000141 


13186239 


2263 


D-0031 32-08 


GTGAAGATGTTGAAAGATG 


7 


FGFR3 














FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-05 


TGTCGGACCTGGTGTCTGA 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-06 


GCATCAAGCTGCGGCATCA 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-07 


GGACGGCACACCGTACGTT 


7 


FGFR3 


NM 000142 


13112046 


2261 


D-0031 33-08 


TGCACAACCTCGACTACTA 


7 


FGFR4 














FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-05 


GCACTGGAGTCTCGTGATG 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-06 


CATAGGGACCTCTCGAATA 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-07 


ATACGGACATCATCCTGTA 


7 


FGFR4 


NM 002011 


13112051 


2264 


D-0031 34-08 


ATAGGGACCTCTCGAATAG 


7 


FGR 














FGR 


NM 005248 


4885234 


2268 


D-0031 35-05 


GCGATCATGTGAAGCATTA 


7 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-06 


TCACTGAGCTCATCACCAA 


7 


FGR 


NM 005248 


4885234 


2268 


D-0031 35-07 


GAAGAGTGGTACTTTGGAA 




FGR 


NM 005248 


4885234 


2268 


D-0031 35-08 


CCGAGAAGCTGCCCTCTTT 


7 


FLT1 














FLT1 


NM 002019 


4503748 


2321 


D-0031 36-05 


GAGCAAACGTGACTTATTT 


7 


FLT1 


NM 002019 


4503748 


2321 


D-0031 36-06 


CCAAATGGGTTTCATGTTA 




FLT1 


NM 002019 


4503748 


2321 


D-0031 36-07 


CAACAAGGATGCAGCACTA 




FLT1 


NM 002019 


4503748 


2321 


D-0031 36-08 


GGACGTAACTGAAGAGGAT 


7 


FLT3 














FLT3 


NM 004119 


4758395 


2322 


D-0031 37-05 


GAAGGCATCTACACCATTA 




FLT3 


NM 004119 


4758395 


2322 


D-0031 37-06 


GAAGGAGTCTGGAATAGAA 




FLT3 


NM 004119 


4758395 


2322 


D-0031 37-07 


GAATTTAAGTCGTGTGTTC 




FLT3 


NM 004119 


4758395 


2322 


D-0031 37-08 


GGAATTCATTTCACTCTGA 




FLT4 














FLT4 


NM 002020 


4503752 


2324 


D-0031 38-05 


GCAAGAACGTGCATCTGTT 




FLT4 


NM 002020 


4503752 


2324 


D-0031 38-06 


GCGAATACCTGTCCTACGA 




FLT4 


NM 002020 


4503752 • 


2324 


D-0031 38-07 


GAAGACATTTGAGGAATTC 


-1 


FLT4 


NM 002020 


4503752 


2324 


D-0031 38-08 


GAGCAGGCATrCATCAACA 




FRK 














FRK ^ 


NM 002031 


4503786 


2444 


D-0031 39-05 


GAAACAGACTCTTCATATT 


- 


FRK 


NM 002031 


4503786 


2444 


D-0031 39-06 


GAACAATACCACTCCAGTA 


- 


FRK 


NM 002031 


4503786 


2444 


D-0031 39-07 


CAAGACCGGTTCCTTTCTA 




FRK 


NM 002031 


4503786 


2444 


D-0031 39-08 


GCAAGAATATGTCGAAAAT 


- 


FYN 














FYN 


NM 002037 


23510344 


2534 


D-0031 40-05 


GGAATGGACTGATATGCAA 


- 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-06 


GCAGAAGAGTGGTACTTTG 


- 


FYN 


NM 002037 


23510344 


2534 


D-0031 40-07 


CAAAGGAAGTTTACTGGAT 




FYN 


NM 002037 


23510344 


2534 


D-0031 40-08 


GAAGAGTGGTACTTTGGAA 




HCK 














HCK 


NM 002110 


4504356 


3055 


D-003141-05 


GAGATACCGTGAAACATTA 




HCK 


NM 002110 


4504356 


3055 


D-0031 41 -06 


GCAGGGAGATACCGTGAAA 




HCK 


NM 002110 


4504356 


3055 


D-0031 41 -07 


CATCGTGGTTGCCCTGTAT 




HCK 


NM 002110 


4504356 


3055 


D-0031 41 -08 


TGTGTAAGATTGCTGACTT 




ITK 














ITK 


NM 005546 


21614549 


3702 


D-0031 44-05 


CAAATAATCTGGAAACCTA 




ITK 


NM 005546 


21614549 


3702 


D-0031 44-06 


GAAGAAACGAGGAATAATA 




ITK 


NM 005546 


21614549 


3702 


D-0031 44-07 


GAAACTCTCTCATCCCAAA 
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ITK 
JAK1 


NM 005546 


21614549 


3702 


D-0031 44-08 


GGAATGGGCATGAAGGATA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-05 


CCACATAGCTGATCTGAAA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-06 


TGAAATCACTCACATTGTA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-07 


TAAGGAACCTCTATCATGA 


7 


JAK1 


NM 002227 


4504802 


3716 


D-0031 45-08 


GCAGGTGGCTGTTAAATCT 


7 


JAK2 














JAK2 


NM 004972 


13325062 


3717 


D-0031 46-05 


GCAAATAGATCCAGTTCTT 


7 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-06 


GAGGAAAGATCCAAGACTA 


7 


JAK2 


NM 004972 


13325062 


3717 


D-003146-07 


GCCAGAAACTTGAAACTTA 


7 


JAK2 


NM 004972 


13325062 


3717 


D-0031 46-08 


GTACAGATTTCGCAGATTT 


7 


JAK3 














JAK3 


NM 000215 


4557680 


3718 


D-0031 47-05 


GCGCCTATCTTTCTCCTTT 


7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-06 


CCAGAAATCGTAGACATTA 


7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-07 


CCTCATCTCTTCAGACTAT 


7 
7 


JAK3 


NM 000215 


4557680 


3718 


D-0031 47-08 


TGTACGAGCTCTTCACCTA 


KDR 














KDR 


NM 002253 


11321596 


3791 


D-0031 48-05 


GGAAATCTCTTGCAAGCTA 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-06 


GATTACAGATCTCCATTTA 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-07 


GCAGACAGATCTACGTTTG 


7 


KDR 


NM 002253 


11321596 


3791 


D-0031 48-08 


GCGATGGCCTGTTCTGTAA 


7 


KIAA1079 














KIAA1079 


NM 014916 


7662475 


22853 


D-0031 49-05 


GAAATTCTCTCAACTGATG 


7 


KIAA1079 


NM 014916 


7662475 


22853 


D-0031 49-06 


GCAG AG GTCTTCACACTTT 


7 


K1AA1079 


NM 014916 


7662475 


22853 


D-0031 49-07 


TAAATGATCTTCAGACAGA 


7 


KIAA1079 


NM 014916 


7662475 


22853 


D-0031 49-08 


GAGCAGCCCTACTCTGATA 


7 


KIT 














KIT 


NM 000222 


4557694 


3815 


D-0031 50-05 


AAACACGGCTTAAGCAATT 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-06 


GAACAGAACCTTCACTGAT 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-07 


GGGAAGCCCTCATGTCTGA 


7 


KIT 


NM 000222 


4557694 


3815 


D-0031 50-08 


GCAATTCCATTTATGTGTT 


7 


LCK 














LCK 


NM 005356 


20428651 


3932 


D-0031 51 -05 


GAAeTGCCATTATCCCATA 


7 
7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -06 


GAGAGGTGGTGAAACATTA 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -07 


GGGCCAAGTTTCCCATTAA 


7 


LCK 


NM 005356 


20428651 


3932 


D-0031 51 -08 


GCACGCTGCTCATCCGAAA 


7 


LTK 














LTK 


NM 002344 


4505044 


4058 


D-0031 52-05 - 


TGAATTCACTCCTGCCAAT 


7 


LTK 


NM" 002344 


4505044 


4058 


D-0031 52-06 


GTGGCAACCTCAACACTGA 


7 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-07 


GGAGCTAGCTGTGGATAAC 


7 


LTK 


NM 002344 


4505044 


4058 


D-0031 52-08 


GCAAGTTTCGCCATCAGAA 


7 


LYN 














LYN 


NM 002350 


4505054 


4067 


D-0031 53-05 


GCAGATGGCTTGTGCAGAA 


7 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-06 


GGAGAAGGCTTGTATTAGT 


7 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-07 


GATGAGCTCTATGACATTA 


7 


LYN 


NM 002350 


4505054 


4067 


D-0031 53-08 


GGTGCTAAGTTCCCTATTA 


7 


MATK 














MATK 


NM 002378 


21450841 


4145 


D-0031 54-05 


TGAAGAATATCAAGTGTGA 


7 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-06 


CCGCTCAGCTCCTGCAGTT 


7 




NM 002378 


21450841 


4145 


D-0031 54-07 


TACTGAACCTGCAGCATTT 


7 


MATK 


NM 002378 


21450841 


4145 


D-0031 54-08 


TGGGAGGTCTTCTCATATG 


8 


MERTK 














MERTK 


NM 006343 


5453737 


10461 


D-0031 55-05 


GAACTTACCTTACATAGCT 


8 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-06 


GGACCTGCATACTTACTTA 


8 


MERTK 


NM 006343 


5453737 


10461 


D-0031 55-07 


TGACAGGAATCTTCTAATT 


8 


MERTK 


NM 006343 


5463737 


10461 


D-0031 55-08 


GGTAATGGCTCAGTCATGA 


8 
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MET 














MET 


NM 000245 


4557746 


4233 


D-0031 56-05 


GAAAGAACCTCTCAACATT 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-06 


GGACAAGGCTGACCATATG 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-07 . 


CCAATGACCTGGTGAAATT 


8 


MET 


NM 000245 


4557746 


4233 


D-0031 56-08 


GAGCATACATTAAACCAAA 


8 


MST1R 














MST1R 


NM 002447 


4505264 


4486 


D-0031 57-05 


GGATGGAGCTGCTGGCTTT 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-06 


CTGCAGACCTATAGATTTA 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-07 


GCACGTGTCTCACTCTTGA 


8 


MST1R 


NM 002447 


4505264 


4486 


D-0031 57-08 


GAAAGAGTCCATCCAGCTA 


8 


MUSK 














MUSK 


NM 005592 


5031926 


4593 


D-0031 58-05 


GAAGAAGCCTCGGGAGATA 


8 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-06 


GTAATAATGTCCATCATGT 


8 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-07 


GGAATGAACTGAAAGTAGT 


8 


MUSK 


NM 005592 


5031926 


4593 


D-0031 58-08 


GAGATTTCCTGGACTAGAA 


8 


NTRK1 














NTRK1 


NM 002529 


458571 1 


4914 


D-0031 59-05 


GGACAACCCTTTCGAGTTC 


8 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-06 


CCAGTGACGTCAACAGGAA 


8 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-07 


CCACAATACTTCAGTGATG 


8 


NTRK1 


NM 002529 


4585711 


4914 


D-0031 59-08 


GAAGAGTGGTCTCCGTTTC 


8 


NTRK2 














NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-05 


GAACAGAAGTAATGAAATC 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-06 


GTAATGCTGTTTCTGCTTA 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-07 


GCAAGACACTCCAAGTTTG 


8 


NTRK2 


NM 006180 


21361305 


4915 


D-0031 60-08 


GAAAGTCTATCACATTATC 


8 


NTRK3 














NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -05 


GAGCGAATCTGCTAGTGAA 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -06 


GAAGTTCACTACAGAGAGT 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-0031 61 -07 


GGTCGACGGTCGAAATTTG 


8 


NTRK3 


NM 002530 


4505474 


4916 


D-003161-08 


GAATATCACTTCCATACAC 


8 


PDGFRA 














PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-05 


GAAACTTCCTGGACTATTT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-06 


GAGAT^lArGGTCAACTATTT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-07 


GGACGCCGGTTCCTGATAT 


8 


PDGFRA 


NM 006206 


15451787 


5156 


D-0031 62-08 


CATCAGAGCTGGATCTAGA 


8 


PDGFRB 














PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-05 


GAAAGGAGACGTCAAATAT 


8 


PDGFRB 


NM -0D2609 


15451788 


5159 


D-0031 63-06 


GGAATGAGGTGGTCAACTT 


8 


PDGFRB 


NM -002609 


15451788 


5159 


D-0031 63-07 


CAACGAGTCTCCAGTGCTA 


8 


PDGFRB 


NM 002609 


15451788 


5159 


D-0031 63-08 


GAGAGGACCTGCCGAGCAA 


8 


PTK2 














PTK2 


NM 005607 


27886592 


5747 


D-0031 64-05 


GAAGTTGGGTTGTCTAGAA 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-06 


GAAGAACAATGATGTAATC 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-07 


GGAAATTGCTTTGAAGTTG 


8 


PTK2 


NM 005607 


27886592 


5747 


D-0031 64-08 


GGTTCAAGCTGGATTATTT 


8 


PTK2B 














PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-05 


GAACATGGCTGACCTCATA 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-06 


GGACCACGCTGCTCTATTT 


8 


PTK2B 


NM 004103 


27886583 


2185 


D-0031 65-07 


GGACGAGGACTATTACAAA 


8 


PTK2B - 


NM 004103 


27886583 


2185 


D-0031 65-08 


TGGCAGAGGTCATCAACAA 


8 


PTK6 














PTK6 


NM 005975 


27886594 


5753 


D-0031 66-05 


GAGAAAGTCCTGCCCGTTT 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-06 


TGAAGAAGCTGCGGCACAA 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-07 


CCGGGACTCTGATGAGAAA 


8 


PTK6 


NM 005975 


27886594 


5753 


D-0031 66-08 


TGCCCGAGCTTGTGAACTA 


8 


PTK7 
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PTK7 


NM 002821 


27886610 


5754 


D-0031 67-05 


GAGAGAAGCCCACTATTAA 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-06 


CGAGAGAAGCCCACTATTA 


8 


PTK7 


NM 002821 


2788661 0 


5754 


D-0031 67-07 


GGAGGGAGTTGGAGATGTT 


8 


PTK7 


NM 002821 


27886610 


5754 


D-0031 67-08 


GAAGACATGCCGCTATTTG 


8 


PTK9 














PTK9 


NM 002822 


4506274 


5756 


D-0031 68-05 


GAAGAACTACGACAGATTA 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-09 


GAAGGAGACTATTTAGAGT 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-10 


GAGCGGATGCTGTATTCTA 


8 


PTK9 


NM 002822 


4506274 


5756 


D-0031 68-11 


CTGCAGACTTCCTTTATGA 


8 


PTK9L 














PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-05 


AGAGAGAGCTCCAGCAGAT 


8 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-06 


TTAACGAGGTGAAGAGAGA 


8 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-07 


ACACAGAGCCCACGGATGT 


8 


PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-08 


GCTGGGATCAGGACTATGA 


8 


RET 














RET 


NM 000323 


21536316 


5979 


D-0031 70-05 


GCAAAGACCTGGAGAAGAT 


8 


RET 


NM 000323 


21536316 


5979 


D-0031 70-06 


GCACACGGCTGCATGAGAA 


S 


RET 


NM 000323 


21536316 


5979 


D-0031 70-07 


GAACTGGCCTGGAGAGAGT 


8 


RET 


NM 000323 


21536316 


5979 


D-0031 70-08 


TTAAATGGATGGCAATTGA 


8 


R0R1 














ROR1 


NM 005012 


4826867 


4919 


D-0031 71-05 


GCAAGCATGTTTAGTAGGA 


e 


ROR1 


NM 005012 


4826867 


4919 


D-0031 71-06 


GAGCAAGGCTAAAGAGCTA 


£ 


R0R1 


NM 005012 


4826867 


4919 


D-0031 71-07 


GAGAGCAACTTCATGTAAA 


€ 


R0R1 


NM 005012 


4826867 


4919 


D-0031 71-08 


GAGAATGTCCTGTGTCAAA 




R0R2 














R0R2 


NM 004560 


19743897 


4920 


D-0031 72-05 


GGAACTCGCTGCTGCCTAT 


8 


R0R2 


NM 004560 


19743897 


4920 


D-0031 72-06 


GCAGGTGCCTCCTCAGATG 




R0R2 


NM 004560 


19743897 


4920 


D-0031 72-07 


GCAATGTGCTAGTGTACGA 




R0R2 


NM 004560 


19743897 


4920 


D-0031 72-08 


GAAGACAGAATATGGTTCA 




R0S1 














R0S1 


NM 002944 


19924164 


6098 


D-0031 73-05 


GAGGAGACCTTCTTACTTA 




R0S1 


NM 002944 


19924164 


6098 


D-0031 73-06 


TTACAGAGGTTCAGGATTA 




R0S1 


NM 002944 


19924164 


6098 


D-0031 73-07 


GAACAAAC©TAAGCATGAA 




R0S1 


NM 002944 


19924164 


6098 


D-0031 73-08 


GAAAGAGCACTTCAAATAA 




RYK 














RYK 


NM 002958 


11863158 


6259 


D-0031 74-05 


GAAAGATGGTTACCGAATA 




RYK 


NM 002958 


11863158 


6259 


D-0031 74-06 


CAAAGTAGATTCTGAAGTT 




RYK 


NM 00295a. 


11863158 


6259 


D-0031 74-07 


TCACTACGCTCTATCGTTT 




RYK 


NM 002958 


11863158 


6259 


D-0031 74-08 


GGTGAAGGATATAGCAATA 




SRC 














SRC 


NM 005417 


21361210 


6714 


D-0031 75-05 


GAGAACCTGGTGTGCAAAG 




SRC 


NM 005417 


21361210 


6714 


D-0031 75-09 


GAGAGAACCTGGTGTGCAA 




SRC 


NM 005417 


21361210 


6714 


D-0031 75-10 


GGAGTTTGCTGGACTTTCT 




SRC 


NM 005417 


21361210 


6714 


D-0031 75-11 


GAAAGTGAGACCACGAAAG 




SYK 














SYK 


NM 003177 


21361552 


6850 


D-0031 76-05 


GGAATAATCTCAAGAATCA 




SYK 


NM 003177 


21361552 


6850 


D-0031 76^6 


GAACTGGGCTCTGGTAATT 




SYK 


NM 003177 


21361552 


6850 


D-0031 76-07 


GGAAGAATCTGAGCAAATT 




SYK 


NM 003177 


21361552 


6850 


D-0031 76-08 


GAACAGACATGTCAAGGAT 




TEC 














TEC 


NM 003215 


4507428 


7006 


D-0031 77-05 


GAAATTGTCTAGTAAGTGA 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-06 


CACCTGAAGTGTTTAATTA 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-07 


GTACAAAGTCGCAATCAAA 




TEC 


NM 003215 


4507428 


7006 


D-0031 77-08 


TGGAGGAGATTCTTATTAA 




TEK 














TEK 


NM 000459 


4557868 


7010 


D-0031 78-05 


GAAAGAATATGCCTCCAAA 


( 
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TEK 


^IM 000459 


4557868 


7010_ 


D-0031 78-06 


GGAATGACATCAAATTTCA 


8 


TEK 


MM 000459 


4557868 


7010 


D-0031 78-07 


TGAAGTACCTGATATTCTA 


8 


TEK 


MM 000459 


4557868 


7010 


D-0031 78-08 


CGAAAGACCTACGTGAATA 


8 


TIE 














TIE 


NM 005424 


4885630 


7075 


D-0031 79-05 


GAGAGGAGGTTTATGTGAA 


8 


TIE 


NM 005424 


4885630 


7075 


D-0031 79-06 


GGGACAGCCTCTACCCTTA 


8 


TIE 


NM 005424 


4885630 


7075 


D-0031 79-07 


GAAGTTCTGTGCAAATTGG 


8 


TIE 


NM 005424 


4885630 


7075 


D-0031 79-08 


CAAGATGGGCTCAGAAGTG 


9 


TNK1 














TNK1 


NM 003985 


4507610 


8711 


D-0031 80-05 


GTTGTGGGCGTAAGTCTAA 


9 


TNK1 


NM 003985 


4507610 


8711 


D-0031 80-06 


GAACTGGGTGTACAAGATC 


9 


TNK1 


NM 003985 


4507610 


8711 


D-0031 80-07 


CGAGAGGTATGGGTGATGA 




TNK1 


NM 003985 


4507610 


8711 


D-0031 80-08 


GGGGGATCGTGGAGCATTA 


s 


TXK 














TXK 


NM 003328 


4507742 


7294 


D-0031 81 -05 


GAACATCTATTGAGACAAG 




TXK 


NM 003328 


4507742 


7294 


D-0031 81 -06 


TGAAGGCACTTTATGATTT 




TXK 


NM 003328 


4507742 


7294 


D-0031 81 -07 


GGAGAGGAATGGCTATATT 




TXK 


NM 003328 


4507742 


7294 


D-0031 81 -08 


GGATATATGTGAAGGAATG 


£ 


TYK2 














TYK2 


NM 003331 


4507748 . 


7297 


D-0031 82-05 


GAGGAGATCCACCACTTTA 




TYK2 


NM 003331 


4507748 


7297 


D-0031 82-06 


GCATCCAGATTGCACATAA 




TYK2 


NM 003331 


4507748 


7297 


D-0031 82-07 


TCAAATAGCTAGCCACACT 


£ 


TYK2 


NM 003331 


4507748 


7297 


D-0031 82-08 


CAATCTTGCTGACGTCTTG 


£ 


TYR03 














TYROS 


NM 006293 


27597077 


7301 


D-0031 83-05 


GGTAGAAGGTGTGCCATTT 


£ 


TYR03 


NM 006293 


27597077 


7301 


D-0031 83-06 


ACGCTGAGATTTACAACTA 


£ 


TYROS 


NM 006293 


27597077 


7301 


D-0031 83-07 


GGATGGCTCCTTTGTGAAA 


£ 


TYROS 


NM 006293 


27597077 


7301 


D-0031 83-08 


GAGAGGAACTACGAAGATC 


£ 


YES1 














YES1 


NM 005433 


21071041 


7525 


D-0031 84-05 


GAAGGACCCTGATGAAAGA 


£ 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-06 


TAAGAAGGGTGAAAGATTT 


£ 


YES1 


NM 005433 


21071041 


7525 


D-003184-07 


TCAAGAAGCTCAGATAATG 


£ 


YES1 


NM 005433 


21071041 


7525 


D-003184-08 


CAGAATGCCTGCATGAATT 


i 



Table VIII 

5 



Gene 
Name 


Acc# 


" Gl 


Locus 
Link 


Duplex # 


Full Sequence 


SEQ. I[ 
NO. 


APC2 














APC2 


NM 013366 


7549800 


29882 


D-003200-05 


GCAAGGACCTCTTCATGAA 


921 


APC2 


NM 013366 


7549800 


29882 


D-003200-06 


GAGAAGAAGTCCACACTAT 


922 


APC2 


NM 013366 


7549800 


29882 


D-003200-07 


GGAATGCCATCTCCCAATG 


923 


APC2 


NM 013366 


7549800 


29882 


D-003200-09 


CAACACGTGTGACATGATC 


924 


ATM 














ATM 


NM 000051 


20336202 


472 


D-003201-05 


GCAAGCAGCTGAAAGAAAT 


925 


ATM 


NM 000051 


20336202 


472 


D-003201-06 


GAATGTTGCTTTCTGAATT 


926 


ATM 


NM 000051 


20336202 


472 


D-003201-07 


GACCTGAAGTCTTATTTAA 


927 


ATM 


NM 000051 


20336202 


472 


D-003201-08 


AGACAGAATTCCCAAATAA 


928 


ATR 














ATR 


NM 001184 


20143978 


545 


D-003202-05 


GAACAACACTGGTGGTTTG 


929 


ATR 


NM 001184 


20143978 


545 


D-003202-06 


GAAGTCATCTGTTCATTAT 


930 


ATR 


NM 001184 


20143978 


545 


D-003202-07 


GAAATAAGGTAGACTCAAT 


931 


ATR 


NM 001184 


20143978 


545 


D-003202-08 


CAACATAAATCGAAGAAGA 


932 


BTAK 
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BTAK 


NM 003600 


3213196 


6790 


D-003545-04 


CAAAGAATCAGCTAGCAAA 


933 


BTAK 


NM 003600 


3213196 


6790 


D-003203-05 


GAAGAGAGTTATTCATAGA 


934 


BTAK 


NM 003600 


3213196 


6790 


D-003203-07 


CAAATGCCCTGTCTTACTG 


935 


STK6 


NM 003600 


3213196 


6790 


D-003203-09 


TCTCGTGACTCAGCAAATT 


936 


CCNA1 














CCNA1 


NM 003914 


16306528 


8900 


D-003204-05 


GAACCTGGCTAAGTACGTA 


937 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-06 


GCAGATCCATTCTTGAAAT 


938 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-07 


TCACAAGAATCAGGTGTTA 


939 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-08 


CATAAAGCGTAGCTTGATA 


940 


CCNA2 














CCNA2 


NM 001237 


16950653 


890 


D-003205-05 


GCTGTGAACTACATTGATA 


941 


CCNA2 


NM 001237 


16950653 


890 


D-003205-06 


GATGATACCTACACCAAGA 


942 


CCNA2 


NM 001237 


16950653 


890 


D-003205-07 


GCTGTTAGCCTCAAAGTTT 


943 


CCNA2 


NM 001237 


16950653 


890 


D-003205-08 


AAGCTGGCCTGAATCATTA 


944 


CCNB1 














CCNB1 


NM 031966 


14327895 


891 


D-003206-05 


CAACATTACCTGTCATATA 


945 


CCNB1 


NM 031966 


14327895 


891 


D-003206-06 


CCAAATACCTGATGGAACT 


946 . 


CCNB1 


NM 031966 


14327895 


891 


D-003206-07 


GAAATGTACCCTCCAGAAA 


947 


CCNB1 


NM 031966 


14327895 


891 


D-003206-08 


GCACCTGGCTAAGAATGTA 


948 


CCNB2 














CCNB2 


NM 004701 


10938017 


9133 


D-003207-05 


CAACAAATGTCAACAAACA 


949 


CCNB2 


NM 004701 


10938017 


9133 


D-003207-06 


GCAGCAAAGTCCTGAAGAT 


950 


CCNB2 


NM 004701 


10938017 


9133 


D-003207-07 


CCAGTGATTTGGAGAATAT 


951 


CCNB2 


NM 004701 


10938017 


9133 


D-003207-08 


GTGACTACGTTAAGGATAT 


952 


CCNB3 














CCNB3 


NM 033031 


14719419 


85417 


D-003208-05 


TGAACAAACTGCTGACTTT 


953 


CCNB3 


NM 033031 


14719419 


85417 


D-003208-06 


GCTAGCTGCTGCCTCCTTA 


954 


CCNB3 


NM 033031 


14719419 


85417 


D-003208-07 


CAACTCACCTCGTGTGGAT 


955 


CCNB3 


NM 033031 


14719419 


85417 


D-003208-08 


GTGGATCTCTACGTAATGA 


956 


CCNC 














CCNC 


NM 005190 


7382485 


892 


D-003209-05 


GCAGAGCTCCCACTATTTG 


957 


CCNC 


NM 005190 


7382485 


892 


D-003209-06 


GGAGTAGTTTCAAATACAA 


958 ■ 


CCNC 


NM 005190 


7382485 


892 


D-003209-07 


GACCTTTGCTCCAGTATGJ 


959 


CCNC 


NM 005190 


7382485 


892 


D-003209-08 


GAGATTCTATGCCAGGTAT 


960 


CCND1 














CCND1 


NM 053056 


16950654 


595 


D-003210-05 


TGAACAAGCTCAAGTGGAA 


961 


CCND1 


NM 053056 


16950654 


595 


D-003210-06 


CCAGAGTGATCAAGTGTGA 


962 


CCND1 


NM 053056 


1695065iL 


595 


D-003210-07 


GTTCGTGGGGTGTAAGATG 


963 


CCND1 


NM 053056 


16950654 


595 


D-003210-08 


CCGAGAAGCTGTGCATCTA 


964 


CCND2 














CCND2 


NM 001759 


16950656 


894 


D-003211-06 


TGAATTACCTGGACCGTTT 


965 


CCND2 


NM 001759 


16950656 


894 


D-00321 1-07 


CGGAGAAGCTGTGCATTTA 


966 


CCND2 


NM 001759 


16950656 


894 


D-00321 1-08 


CTACAGACGTGCGGGATAT 


967 


CCND2 


NM 001759 


16950656 


894 


D-00321 1-09 


CAACACAGACGTGGATTGT 


968 


CCND3 














CCND3 


NM 001760 


16950657 


896 


D-00321 2-05 


GGACCTGGCTGCTGTGATT 


969 


CCND3 


NM 001760 


16950657 


896 


D-00321 2-06 


GATTATACCTTTGCCATGT 


970 


CCND3 


NM 001760 


16950657 


896 


D-00321 2-07 


GACCAGCACTCCTACAGAT 


971 


CCND3 


NM 001760 


16950657 


896 


D-00321 2-08 


TGCGGAAGATGCTGGCTTA 


972 
















CCNE1 


NM 001238 


17318558 


898 


D-00321 3-05 


GTACTGAGCTGGGCAAATA 


973 


CCNE1 


NM 001238 


17318558 


898 


D-00321 3-06 


GGAAATCTATCCTCCAAAG 


974 


CCNE1 


NM 001238 


17318558 


898 


D-00321 3-07 


GGAGGTGTGTGAAGTCTAT 


975 


CCNE1 


NM 001238 


17318558 


898 


D-00321 3-08 


CTAAATGACTTACATGAAG 


976 


CCNE2 














CCNE2 


NM 057749 


17318564 


9134 


D-00321 4-05 


GGATGGAACTCATTATATT 


977 



wo 2004/045543 PCT/US2003/036787 
116 



CCNE2 


NM 057749 


17318564 


9134 


D-003214-06 


GCAGATATGTTCATGACAA 


978 


CCNE2 


NM 057749 


17318564 


9134 


D-003214-07 


CATAATATCCAGACACATA 


979 


CCNE2 


NM 057749 


17318564 


9134 


D-003214-08 


TAAGAAAGCCTCAGGTTTG 


980 


CCNF 














CCNF 


NM 001761 


4502620 


899 


D-003215-05 


TCACAAAGCATCCATATTG 


981 


CCNF 


NM 001761 


4502620 


899 


D-003215-06 


GAAGTCATGTTTACAGTGT 


982 


CCNF 


NM 001761 


4502620 


899 


D-003215-07 


TAGCCTACCTCTACAATGA 


983 


CCNF 


NM 001761 


4502620 


899 


D-003215-08 


GCACCCGGTTTATCAGTAA 


984 


CCNG1 














CCNG1 


NM 004060 


8670528 


900 


D-003216-05 


GATAATGGCCTCAGAATGA 


985 


CCNG1 


NM 004060 


8670528 


900 


D-003216-06 


GCAGGGCAATTGAAGCATA 


986 


CCNG1 


NM 004060 


8670528 


900 


D-003216-07 


GGAATAGAATGTCTTCAGA 


987 


CCNG1 


NM 004060 


8670528 


900 


D-003216-08 


TAACTCACCTTCCAACAAT 


988 


CCNG2 














CCNG2 


NM 004354 


4757935 


901 


D-003217-05 


GGAGAGAGTTGGTTTCTAA 


989 


CCNG2 


NM 004354 


4757935 


901 


D-003217-06 


GGTGAAACGTAAACATTTG 


990 


CCNG2 


NM 004354 


4757935 


901 


D-003217-07 


GAAATACTGAGCCTTGATA 


991 


CCNG2 


NM 004354 


4757935 


901 


D-0032 17-08 


TGGCAAAGTrGAAGATTTA 


992 


CCNH 














CCNH 


NM 001239 


17738313 


902 


D-003218-05 


GCTGATGACTTTCTTAATA 


993 


CCNH 


NM 001239 


17738313 


902 


D-003218-06 


CAACTTAATTTCCACCTTA 


994 


CCNH 


NM 001239 


17738313 


902 


D-0032 18-07 


ATACACACCTTCCCAAATT 


995 


CCNH 


NM 001239 


17738313 


902 


D-003218-08 


GCTATGAAGATGATGATTA 


996 


CCNI 














CCNI 


NM 006835 


17738314 


10983 


D-003219-05 


GCAAGCAGACCTCTACTAA 


997 


CCNI 


NM 006835 


17738314 


10983 


D-003219-07 


TGAGAGAATTCCAGTACTA 


998 


CCNI 


NM 006835, 


17738314 


10983 


D-003219-08 


GGAATCAAACGGCTCTATA 


999 


CCNI 


NM 006835 


17738314 


10983 


D-003219-09 


GAATTGGGATCTTCACACA 


1000 


CCNT1 














CCNT1 


NM 001240 


17978465 


904 


D-003220-05 


TATCAACACTGCTATAGTA 


1001 


CCNT1 


NM 001240 


17978465 


904 


D-003220-06 


GAACAAACGTCCTGGTGAT 


1002 


CCNT1 


NM 001240 


17978465 


904 


D-003220-07 


GCACAAGACTCACCCATCT 


1003 


CCNT1 


NM 001240 


17978465 


904 


D-003220-08 


GCACAGACTTCTTACTTCA • 


r1004 


CCNT2A 














CCNT2A 


NM 001241 


1 7978467 


905 


D-003221-05 


GCACAGACATCCTATTTCA 


1005 


CCNT2A 


NM 001241 


17978467 


905 


D-003221-06 


GCAGGGACCTTCTATATCA 


1006 


CCNT2A 


NM 001241 


17978467 


905 


D-003221-07 


GAACAGCTATATTCACAGA 


1007 


CCNT2A 


NM 001241 


17978467. 


505 


D-003221-09 


TTATATAGCTGCCCAGGTA 


1008 


CCNT2B 














CCNT2B 


NM 058241 


17978468 


905 


D-003222-05 


GCACAGACATCCTATTTCA 


1009 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-06 


GCAGGGACCTTCTATATCA 


1010 


CCNT2B 


NM 058241 


17978468 


905 


D-00 3222-07 


GAACAGCTATATTCACAGA 


1011 


CCNT2B 


NM 058241 


17978468 


905 


D-003222-08 


GGTGAAATGTACCCAGTTA 


1012 


CDC16 














CDC16 


NM 003903 


14110370 


8881 


D-003223-O5 


GTAGATGGCTTGCAAGAGA 


1013 


CDC16 


NM 003903 


14110370 


8881 


D-003223-06 


TAAAGTAGCTTCACTCTCT 


1014 


CDC16 


NM 003903 


14110370 


8881 


D-003223-07 


GCTACAAGCTTACTTCTGT 


1015 


CDC16 


NM 003903 


14110370 


8881 


D-003223-08 


TGGAAGAGCCCATCAATAA 


1016 


CDC2 
















iNivi uooo/y 


27886643 


983 


D-003552-O1 


GTACAGATCTCCAGAAGTA 


1017 


CDC2 


NM 033379 


27886643 


983 


D-O03552-O2 


GATCAACTCTTCAGGATTT 


1018 


CDC2 


NM 033379 


27886643 


983 


D-003552-03 


GGTTATATCTCATCTTTGA 


1019 


CDC2 


NM 033379 


27886643 


983 


D-003552-04 


GAACTTCGTCATCCAAATA 


1020 


CDC20 














CDC20 


NM 001255 


4557436 


991 


D-003225-05 


GGGAATATATATCCTCTGT 


1021 


CDC20 


NM 001255 


4557436 


991 


D-003225-06 


GAAACGGCTTCGAAATATG 


1022 
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CDC20 


NM 001255 


4557436 


991 


D-003225-07 


GAAGACCTGCCGTTACATT 


1023 


CDC20 


NM 001255 


4557436 


991 


D-003225-08 


CACCAGTGATCGACACATT 


1024 


CDC25A 














CDC25A 


NM 001789 


4502704 


993 


D-003226-05 


GAAATTATGGCATCTGTTT 


1025 


CDC25A 


NM 001789 


4502704 


993 


D-003226-06 


TACAAGGAGTTCTTTATGA 


1026 


CDC25A 


NM 001789 


4502704 


993 


D-003226-07 


CCACGAGGACTTTAAAGAA 


1027 


CDC25A 


NM 001789 


4502704 


993 


D-O03226-08 


TGGGAAACATCAGGATTTA 


1028 


CDC25B 














CDC25B 


NM 004358 


11641416 


994 


D-003227-05 


GCAGATACCCCTATGAATA 


1029 


CDC25B 


NM 004358 


11641416 




D-003227-06 


CTAGGTCGCT^I ctctctga 


1030 


CDC25B 


NM 004358 


11641416 


994 


D-003227-07 


GAGAGCTGATTGGAGATTA 


1031 


CDC25B 


NM 004358 


11641416 


994 


D-003227-08 


AAAAGGACCTCGTCATGTA 


1032 


CDC25C 














CDC25C 


NM 001790 


12408659 


995 


D-003228-05 


GAGCAGAAGTGGCCTATAT 


1033 


CDC25C 


NM 001790 


12408659 


995 


D-003228-06 


CAGAAGAGATTTCAGATGA 


1034 


CDC25C 
CDC25C 


NM 001790 
NM 001790 


12408659 
12408659 


995 
995 


D-003228-07 


CCAGGGAGCCTTAAACTTA 


1035 


CDC27 








D-003228-08 


GAAACTTGGTGGACAGTGA 


1036 


CDC27 


NM 001256 


16554576 


996 


D-003229-06 


CATGCAAGCTGAAAGAATA 


1037 


CDC27 


NM 001256 


16554576 


996 


D-003229-07 


CAACACAAGTACCTAATCA 


1038 


CDC27 


NM 001256 


16554576 


996 


D-003229-08 


GGAGATGGATGCTATTTAC 


1039 


CDC27 


NM 001256 


16554576 


996 


D-003229-09 


GAAAAGCCATGATGATATT 


1040 


CDC34 














CDC34 


NM 004359 


16357476 


997 


D-003230-05 


GCTCAGACCTCTTCTACGA 


1041 


CDC34 


NM 004359 


16357476 


997 


D-003230-06 


GGACGAGGGCGATCTATAG 


1042 


CDC34 


NM 004359 


16357476 


997 


D-003230-07 


GATGGGGAGTACACAGACA 


1043 


CDC34 


NM 004359 


16357476 


997 


D-003230-08 


TGAACGAGCCCAACACCTT 


1044 


CDC37 














CDC37 


NM 007065 


16357476 


11140 


D-003231-05 


GCGAGGAGACAGGCAATTA 


1045 


CDC37 


NM 007065 


16357478 


11140 


D-003231-06 


CACAAGACCTTCGTGGAAA 


1046 


CDC37 


NM 007065 


16357478 


11140 


D-003231-07 


ACAATCGTCATGCAATTTA 


1047 


CDC37 


NM 007065 


16357478 


11140 


D-003231-08 


GAGGAGAAATGTGCACTCA 


1048 


CDC45L 














CDC45L 


NM 003504 


34335230 


8318 


D-003232-05 


GCACACGGATCTCCTTTGA 


1049 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-06 


GCAAACACCTGCTCAAGTC 


1050 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-07 


TGAAGAGTCTGCAAATAAA 


1051 


CDC45L 


NM 003504 


34335230 


8318 


D-003232-08 


GGACGTGGATGCTCTGTGT 


1052 


CDC6 














CDC6 


NM 001254 


16357469 


990' ' 


D-003233-05 


GAACAGAGCTGTCCCAGAT 


1053 


CDC6 


NM 001254 


16357469 


990 


D-003233-06 


GAGCAGAGATGTCCACTGA 


1054 


CDC6 


NM 001254 


16357469 


990 


D-003233-07 


GGAAATATCTTAGCTACTG 


1055 


CDC6 


NM 001254 


16357469 


990 


D-003233-08 


GGACGAAGATTGGTATTTG 


1056 


CDC7 














CDC7 


NM 003503 


11038647 


8317 


D-003234-05 


GGAATGAGGTACCTGATGA 


1057 


CDC7 


NM 003503 


11038647 


8317 


D-003234-06 


CAGGAAAGGTGTTCACAAA 


1058 


CDC7 


NM 003503 


11038647 


8317 


D-003234-07 


CTACACAAATGCACAAATT 


1059 


CDC7 


NM 003503 


11038647 


8317 


D-003234-08 


GTACGGGAATATATGCTTA 


1060 


CDK10 














CDK10 


NM 003674 


32528262 


8558 


D-003235-05 


GAACTGCTGTTGGGAACCA 


1061 




NM 003674 


32528262 


8558 


D-003235-06 


GGAAGCAGCCCTACAACAA 


1062 


CDK10 


NM 003674 


32528262 


8558 


D-003235-07 


GCACGCCCAGTGAGAACAT 


1063 


CDK10 


NM 003674 


32528262 


8558 


D-003235-08 


GGAAGCAGCCCTACAACAA 


1064 


CDK2 














CDK2 


NM 001798 


16936527 


1017 


D-003236-05 


GAGCTTAACCATCCTAATA 


1065 


CDK2 


NM 001798 


16936527 


1017 


D-003236-06 


GAGCTTAACCATCCTAATA 


1066 


CDK2 


NM 001798 


16936527 


1017 


D-003236-07 


GTACCGAGCTCCTGAAATC 


1067 
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CDK2 


NM 001798 


16936527 


1017 


D-003236-08 


GAGAGGTGGTGGCGCTTAA 


1068 


CDK3 














CDK3 


NM 001258 


4557438 


1018 


D-003237-05 


GAGCATTGGTTGCATGTTT 


1069 


CDK3 


NM 001258 


4557438 


1018 


D-003237-06 


GATCGGAGAGGGCACCTAT 


1070 


CDK3 


NM 001258 


4557438 


1018 


D-003237-07 


GAAGCTCTATCTGGTGTTT 


1071 


CDK3 


NM 001258 


4557438 


1018 


D-003237-08 


GCAGAGATGGTGACTCGAA 


1072 


CDK4 














CDK4 


NM 000075 


456426 


1019 


D-003238-05 


GCAGCACTCTTATCTACAT 


1073 


CDK4 


NM 000075 


456426 


1019 


D-003238-06 


GGAGGAGGCCTTCCCATCA 


1074 


CDK4 


NM 000075 


456426 


1019 


D-003238-07 


TCGAAAGCCTCTCTTCTGT 


1075 


CDK4 


NM 000075 


456426 


1019 


D-003238-08 


GTACCGAGCTCGCGAAGTT 


1076 


CDK5 














CDK5 


NM 004935 


4826674 


1020 


D-003239-05 


TGACCAAGCTGCCAGACTA 


1077 


CDK5 


NM 004935 


4826674 


1020 


D-003239-06 


GAGCTGAAATTGGCTGATT 


1078 


CDK5 


NM 004935 


4826674 


1020 


D-003239-07 


CAACATCCCTGGTGAACGT 


1079 


CDK5 


NM 004935 


4826674 


1020 


D-003239-08 


GGATrCCCGTCCGCTGTTA 


1080 


CDK6 














CDK6 


NM 001259 


16950658 


1021 


D-003240-05 


GCAAAGACCTACTTCTGAA 


1081 


CDK6 


NM 001259 


16950658 


1021 


D-003240-06 


GAAGAAGACTGGCCTAGAG 


1082 


CDK6 


NM 001259 


16950658 


1021 


D-003240-07 


GGTCTGGACTTTCTTCATT 


1083 


CDK6 


NM 001259 


16950658 


1021 


D-003240-08 


TAACA6ATATCGATGAACT 


1084 


CDK7 














CDK7 


NM 001799 


16950659 


1022 


D-003241-05 


GGACATAGATCAGAAGCTA 


1085 


CDK7 


NM 001799 


16950659 


1022 


D-003241-06 


CAATAGAGCTTATACACAT 


1086 


CDK7 


NM 001799 


16950659 


1022 


D-003241-07 


CATACAAGGCTTATTCTTA 


1087 


CDK7 


NM 001799 


16950659 


1022 


D-003241-08 


GGAGACGACTTACTAGATC 


1088 


CDK8 














CDK8 


NM 001260 


4502744 


1024 


D-003242-05 


CCACAGTACTCACATCAGA 


1089 


CDK8 


NM 001260 


4502744 


1024 


D-003242-06 


GCAATAACCAGAGTAATGG 


1090 


CDK8 


NM 001260 


4502744 


1024 


D-003242-07 


GAAGAAAGTGAGAGTTGTT 


1091 


CDK8 


NM 001260 


4502744 


1024 


D-003242-08 


GAACATGACCTCTGGCATA 


1092 


CDK9 














CDK9 


NM 001261 


17017983 


1025 


D-003243-05 


GGCCAAACGTGGACAACTA 


1093 ;.' 


CDK9 


NM 001261 


17017983 


1025 


D-003243-06 


TGACGTCCATGTTCGAGTA 


1094 


CDK9 


NM 001261 


17017983 


1025 


D-003243-07 


CCAACCAGACGGAGTTTGA 


1095 


CDK9 


NM 001261 


17017983 


1025 


D-003243-08 


GAAGGTGGCTCTGAAGAAG 


1096 


CDKN1C 












- 


CDKN1C 


NM 000076 


4557440 


1028 . 


D^.03244-05 


GACCAGAACGGCTGGGATT 


1097 


CDKN1C 


NM 000076 


4557440 


1028 


■D-003244-06 


GGACCGAAGTGGACAGCGA 


1098 


CDKN1C 


NM 000076 


4557440 


1028 


D-003244-08 


GCAAGAGATCAGCGCCTGA 


1099 


CDKN1C 


NM 000076 


4557440 


1028 


D-003244-09 


CCGCTGGGATTACGACTTC 


1100 


CDKN2B 














CDKN2B 


NM 004936 


17981693 


1030 


D-003245-05 


GCGAGGAGAACAAGGGCAT 


1101 


CDKN2B 


NM 004936 


17981693 


1030 


D-003245-06 


GCAACGGAGTCAAGCGTrT 


1102 


CDKN2B 


NM 004936 


17981693 


103O 


D-003245-07 


CGATCCAGGTCATGATGAT 


1103 


CDKN2B 


NM 004936 


17981693 


1030 


D-003245-08 


CCTGGAAGCCGGCGCGGAT 


1104 


CDKN2C 














CDKN2C 


NM 001262 


17981697 


1031 


D-003246-05 


GGACACCGCCTGTGATTTG 


1105 


CDKN2C 


NM 001262 


17981697 


1031 


D-003246-06 


GCCAGGAGACTGCTACTTA 


1106 


CDKN2C 


NM 001262 


17981697 


1031 


D-003246-07 


TGAAAGACCGAACTGGTTT 


1107 


CDKN2C 


NM 001262 


17981697 


1031 


D-003246-08 


GAACCTGCCCTTGCACTTG 


1108 


CDKN2D 














CDKN2D 


NM 001800 


17981700 


1032 


D-003247-05 


TGGCAGTTCAAGAGGGTCA 


1109 


CDKN2D 


NM 001800 


17981700 


1032 


D-003247-06 


CTCAGGACCTCGTGGACAT 


1110 


CDKN2D 


NM 001800 


17981700 


1032 


D-003247-07 


TGAAGGTCCTAGTGGAGCA 


1111 


CDKN2D 


NM 001 80Q 


17981700 


11032 


D-003247-08 


AGACGGCGCTGCAGGTCAT 


1112 
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CDT1 














CDT1 


NM 030928 


19923847 


81620 


D-003248-05 


CCAAGGAGGCACAGAAGCA 


1113 


CDT1 


NM 030928 


19923847 


81620 


D-003248-06 


GCTTCAACGTGGATGAAGT 


1114 


CDT1 


NM 030928 


19923847 


81620 


D-003248-07 


TCTCCGGGCCAGAAGATAA 


1115 


CDT1 


NM 030928 


19923847 


81620 


D-003248-08 


GCGCAATGTTGGCCAGATC 


1116 


CENPA 














CENPA 


NM 001809 


4585861 


1058 


D-003249-05 


GCACACACCTCTTGATAAG 


1117 


CENPA 


NM 001809 


4585861 


1058 


D-003249-06 


GCAAGAGAAATATGTGTTA 


1118 


CENPA 


NM 001809 


4585861 


1058 


D-003249-07 


TTACATGCAGGCCGAGTTA 


1119 


CENPA 


NM 001809 


4585861 


1058 


D-003249-08 


GAGACAAGGTTGGCTAAAG 


1120 


CENPB 














CENPB 


NM 001810 


26105977 


1059 


D-003250-05 


{Zf^AC^AXJ^c^or^finr'Tn i i i 

^JOMwM 1 MV3V*OOV»0 1 oL» 1 1 1 




CENPB 


NM 001810 


26105977 


1059 


D-003250-06 


GCACGATCCTGAAGAACAA 


1122 


CENPB 


NM 001810 


26105977 


1059 


D-003250-07 


GGAGGAGGGTGATGTTGAT 


1123 


CENPB 


NM 001810 


26105977 


1059 


D-003250-08 


CCGAATGGCTGCAGAGTCT 


1124 


CENPC1 














CENPC1 


NM 001812 


4502778 


1060 


D-003251-05J 


GCGAATAGATTATCAAGGA 


1125 


CENPC1 
CENPC1 


NM 001812 
NM 001812 


4502778 
4502778 


1060 
1060 


D-003251-06 


GAACAGAATCCATCACAAA 


1126 


CENPC1 


NM 001812 


4502778 


1060 


D-003251-07 
D-003251-08 


CCATAAACCTCACCCAGTA 
CAAGAGAACACGTTTGAAA 


1127 
1128 


CENPE 














CENPE 


NM 001813 


4502780 


1062 


D-003252-05 


GAAGACAGCTCAAATAATA 


1129 


CENPE 


NM 001813 


4502780 


1062 


D-003252-06 


CAACAAAG CTACTAAATCA 


1130 


CENPE 


NM 001813 


4502780 


1062 


D-003252-07 


GGAAAGAAGTGCTACCATA 


1131 


CENPE 


NM 001813 


4502780 


1062 


D-003252-08 


G6AAAGAAATGACACAGTT 


1132 


CENPF 














CENPF 


NM 016343 


14670380 


1063 


D-003253-05 


GCGAATATCTGAATTAGAA 


1133 


CENPF 


NM 016343 


14670380 


1063 


D-003253-06 


GGAAATTAATGCATCCTTA 


1134 


CENPF 


NM 016343 


14670380 


1C63 


D-003253-07 


GAGCGAGGCTGGTGGTTTA 


1135 


CENPF 


NM 016343 


14670380 


1063 


D-003253-08 


CAAGTCATCTTTCATCTAA 


1136 


CENPH 














CENPH 


NM 022909 


21264590 


64946 


D-003254-05 


GAAAGAAGAGATTGCAATT 


1137 


CENPH 


NM 022909 


21264590 


64946 


D-003254-06 


CAGAACAAATTATGCAAGA 


1138 


CENPH 


NM 022909 


21264590 


64946 


D-003254-07 


CTAGTGTGCTCATGGATAA 


1139 


CENPH 


NM 022909 


21264590 


64946 


D-003254-08 


GAAACACCTATTAGAGCTA 


1140 


CHEK1 














CHEK1 


NM 001274 


20127419 


1111 


D-003255-05 


CAAATTGGATGCAGACAAA 


1141 


CHEK1 


NM 001274 


20127419 


1111 


D-G03255-06 


GOAACAGTATTTCGGTATA 


1142 


CHEK1 


NM 001274 


20127419 


1111 


D-00'3255-07 


GGACTTCTCTCCAGTAAAC 


1143 


CHEK1 


NM 001274 


20127419 


1111 


D-003255-08 


AAAGATAGATGGTACAACA 


1144 


CHEK2 














CHEK2 


NM 007194 


22209010 


11200 


D-003256-02 


CTCTTACATTGCATACATA 


1145 


CHEK2 


NM 007194 


22209010 


11200 


D-003256-03 


TAAACGCCTGAAAGAAGCT 


1146 


CHEK2 


NM 007194 


22209010 


11200 


D-003256-04 


GCATAGGACTCAAGTGTCA 


1147 


CHEK2 


NM 007194 


22209010 


11200 


D-003256-05 


GAAATTGCACTGTCACTAA 


1148 


CNK 














CNK 


NM 004073 


4758015 


1263 


D-003257-05 


GCGAGAAGATCCTAAATGA 


1149 


CNK 


NM 004073 


4758015 


1263 


D-003257-07 


GCAAGTGGGTTGACTACTC 


1150 


CNK 


NM 004073 


4758015 


1263 


D-003257-08 


GCACATCCGTTGGCCATCA 


1151 


CNK 


NM 004073 


475801 5 




D-003257-09 


GACCTCAAGTTGGGAAATT 


1152 


CRM 














CRM 


NM 014335 


7656937 


23741 


D-003258-05 


GTGATGAGATTATTGATAG 


1153 


CRM 


NM 014335 


7656937 


23741 


D-003258-06 


GGACGAGGGCGAGGAATTT 


1154 


CRM 


NM 014335 


7656937 


23741 


D-003258-07 


GGAAACGGAGCCTTGCTAA 


1155 


CRM 


NM 014335 


7656937 


23741 


D-003258-G8 


TCAATCGTCTGACCGAAGA 


1156 


E2F1 
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E2F1 


NM 005225 


12669910 




u-uuo^oy-uo 




1157 


E2F1 


NM 005225 


12669910 




D-003259-06 


Tf^(^APPAOOT/^AXr^ A AT AT 
1 ooA»jL>AoO I oA 1 oAA 1 A 1 


1158 


E2F1 


NM 005225 


12669910 




n-nmo<;Q 07 


CCCAGGAGGTCACTTCTGA 


1159 


E2F1 


NM 005225 


12669910 


... 




u-uuo^oy-uo 


GGCTGGACCTGGAAACTGA 


1160 


E2F2 














E2F2 


NM 004091 


3448571 8 


■ 


n oo'ioftn^i^ 


■ — ■ 

GGGAGAAGACTCGGTATGA 


1161 


E2F2 


NM 004091 


3448571 8 




D-003260-06 


GAGGACAACCTGCAGATAT 


1162 


E2F2 


NM 004091 


3448571 8 




n no'^oftn 07 

LI-UUoZOU"U / 


TGAAGGAGCTGATGAACAC 


1163 


E2F2 


NM 004091 


3448571 8 





D-003260-08 


CCAAGAAGTTCATTTACCT 


1164 


E2F3 














E2F3 


NM 001949 


1 266991 3 




D-003261 -05 


GAAATTAGATGAACTGATC 


1165 


E2F3 


NM 001949 


12669913 


"T«7i — 


U-UU%5ilDl -UD 


TGAAGTGCCTGACTCAATA 


1166 


E2F3 


NM 001949 


1 266991 3 




D-003261 -07 


GAACAAGGCAGCAGAAGTG 


1167 


E2F3 


NM 001949 


12669913 


_2i — 


n oo'^ofti OP 

U-UUo^O 1 -Uo 


GAAACACACAGTCCAATGA 


1168 


E2F4 






— — — 








E2F4 


NM 001950 


12669914 




D-003262-05 


■ 

GGAGATTGCTGAGAAACTG 


1169 


E2F4 


NM 001950 


1 266991 4 




n oo^ofto Oft 


G AAG GTATCGGGCTAATCG 


1170 


E2F4 


NM 001950 


1 ^ooyy 1 H 




D-003262-07 


GTGCAGAAGTCCAGGGAAT 


1171 


eIfs — 


NM 001950 


1 ^Doyy 1 1 




D-003262-08 


GGAGAGTGGTGAGGTCAGT 


1172 








— — — 


— 






E2F5 

--- 


NM 001951 


1 ^DDyyn d 




D-003263-05 


GCAGATGACTAGAACTTTA 


1173 




INm UU loo 1 


1 <iDDyyi 0 


1|75_ 


D-003263-06 


GACATCAGCTACAGATATA 


1174 


P P 


iMM UU lyo 1 


1 266991 6 


1875_ 


D-003263-O7 


CAACATGTCTGTGAAAGAA 


1175 


E2F5 


viivi UU lyo 1 


12669916 


1875_ 


D-003263-08 


GAAGACATCTGTAATTGGT 


1176 


E2F6 







. _ 








"Hfr — 


iNivi UU lyo^ 


1 266991 7 


^876_ 


D-003264-05 


TAAACAAGGTTGCAACGAA 


1177 




InIVI UU ihfOZ 


1 266991 7 


^876_ 


D-003264-06 


TAGCATATGTGACCTATCA 


1178 


P 


inivi UU lyo^ 


1 266991 7 


J876_ 


D-003264-07 


GAAACCAGATTGGATGTTC 


1179 


FPFfi 

Fn^ 


iNivi uujyo^ 


1 266991 7 


J876_ 


D-003264-09 


GGAACTTTGTGACTTATCA 


1180 


ml 

















INM UUO^D^ 


r-r 

6552332 


2353_ 




D-003265-05 


GGGATAGCCTCTCTTACTA 


1181 


FOS 


NiVI UU0/£O^ 


6552332 




D-003265-06 


GAACAGTTATCTCCAGAAG 


1182 


FOS t 


NM 005252 


6552332 




D-003265-07 


GGAGACAGACCAACTAGAA 


1183 


— 


iNivi uuo^Oi^ 




2353_ 


D-003265-08 


AGACCGAGCGCTTTGATGA 


1184 



















.. J~ 


INIVI uzzf'fu 


1 3430859 


^^^^^ 


D-003266-06 


GAGAATGAGTCCAATCGAA 


1185 


"hipkI — 


l^lVI UZz / 'tU 


1 3430859 


^^^^^ 


D-003266-07 


AGAGAGGGATTAAGTGAAA 


1186 


hipkI — 


INIVI U^^/'fU 


1 3430859 


^^^^^ 


D-003266=Q8 


GGACAAAGAGAAGTAGGTT 


1187 




noo7/in 

INIVI U^^a4U 


1 3430859 


28996_ 


D-003266-09 


GCACACACGTGAAATGATG 


1188 


hiusi — 














HUS1 


NM 004507 


O lUr lo 






D-003267-05 


ACAAAGGGCTTATGCAATA 


1189 


HUS1 


NM 004507 


O 1 U/ / ^ 1 o 




D-003267-06 


GAAGTGCACATAGATATTA 


1190 


HUS1 


NM 004507 


O lU/ lo 


^^^^ 


D-003267-07 


AAGCTTAACTTCATGGTTT 


1191 


HUS1 


NM 004507 


^1 07701 '5 




D-003267-08 


GAACTTCTTCAACGAATTT 


1192 


JUN 
















NM 002228 


7710122 


^^^^ 


— 

D-003268-05 


TGGAAACGAGGTTCTATGA 


1193 


JUN 


NM 002228 


7710122 




D-003268-06 


GAACTGCACAGCCAGAACA 


1194 




NM 002228 


7710122 


^^^^ 


D-003268-07 


GAGCTGGAGGGGGTGATAA 


1195 


JUN 


NM 002228 


7710122 


3725 


D-003268-08 


TAAGGCAGCAGTTGGAAAC 


1196 


JUNB 














JUNE 


NM 002229 


4504808 


3726 


D-003269-05 


GCATGAAAGTGGAGGGGAA 


1197 


JUNB 


NM 002229 


4504808 


3726 


D-003269-06 


TGGAAGAGCAAGAGGGGAT 


1198 


JUNB 


NM 002229 


4504808 


3726 


D-003269-07 


GATACACAGCTACGGGATA 


1199 


JUNB 


NM 002229 


4504808 


3726 


D-003269-08 


CCATCAACATGGAAGACCA 


1200 


LOC510 
53 
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LOC510 
53 


NM_015895 


20127542 


51053 


D-003270-05 


GGAGAAAGGCGCTGTATGA 


1201 


LOC510 
53 


NM_015895 


20127542 


51053 


D-003270-06 


GAATAGTTCTGTCCCAAGA 


1202 


LOC510 
53 


NM_015895 


20127542 


51053 


D-003270-07 


GAACATGTACAGTATATGG 


1203 


LOC510 
53 


NM_015895 


20127542 


51053 


D-003270-08 


GCAGAAACAAGAAGAAATC 


1204 


MAD2L1 














MAD2L1 


NM 002358 


6466452 


4085 


D-003271-05 


GAAAGATGGCAGTTTGATA 


1205 


MAD2L1 


NM 002358 


6466452 


4085 


D-003271-06 


TAAATAATGTGGTGGAACA 


1206 


MAD2L1 


NM 002358 


6466452 


4085 


D-003271-07 


GAAATCCGTTCAGTGATGA 


1207 


MAD2L1 


NM 002358 


6466452 


4085 


D-003271-08 


TTACTCGAGTGCAGAAATA 


1208 


MAD2L2 














MAD2L2 


NM 006341 


6006019 


10459 


D-003272-05 


GGAAGAGCGCGCTCATAAA 


1209 


MAD2L2 


NM 006341 


6006019 


10459 


D-003272-06 


TGGAAGAGCGCGCTCATAA 


1210 


MAD2L2 


NM 006341 


6006019 


10459 


D-003272-07 


AGCCACTCCTGGAGAAGAA 


1211 


MAD2L2 


NM 006341 


6006019 


10459 


D-003272-08 


TGGAGAAATTCGTCTTTGA 


1212 


MCM2 














MCM2 


NM 004526 


33356546 


4171 ■ 


U-003273-05 


GAAGATCTTTGCCAGCATT 


1213 


MCM2 


NM 004526 


33356546 


4171 


D-003273-06 


GGATAAGGCTCGTCAGATC 


1214 


MCM2 


NM 004526 


33356546 


4171 


D-003273-07 


CAGAGCAGGTGAGATATGA 


1215 


MCM2 


NM 004526 


33356546 


4171 


D-003273-08 


GCCGTGGGCTCCTGTATGA 


1216 


MCM3 














MCM3 


NM 002388 


33356548 


4172 


D-003274-05 


GGACATCAATATTCTTCTA 


1217 


MCM3 


NM 002388 


33356548 


4172 


D-003274-06 


GCCAGGACATCTCCAGTTA 


1218 


MCM3 


NM 002388 


33356548 


4172 


D-003274-07 


GCAGGTATGACCAGTATAA 


1219 


MCM3 


NM 002388 


33356548 


4172 


D-003274-08 


GGAAATGCCTGAAGTACAG 


1220 


MCM4 














MCM4 


XM 030274 


22047061 


4173 


D-003275-05 


GGACATATCTATTCTTACT 


1221 


MCM4 


XM 030274 


22047061 


4173 


D-003275-06 


GATGTTAGTTCACCACTGA 


1222 


MCM4 


XM 030274 


22047061 


4173 


D-003275-07 


CCAGCTGCCTCATACTTTA 


1223 


MCM4 


XM 030274 


22047061 


4173 


D-003275-08 


GAAAGTACAAGATCGGTAT 


1224 


MCM5 














MCIV15 


NM 006739 


23510447 


4174 


D-003276-05 


GAAGATCCCTGGCATCATC 


1225 


MCM5 


NM 006739 


23510447 


4174 


D-003276-06 


GAACAGGGTTACCATCATG 


1226 


MCM5 


NM 006739 


23510447 


4174 


D-003276-07 


GGACAACATTGACTTCATG 


1227 


MCM5 


NM 006739 


23510447 


4174 


D-003276-08 


CCAAGGAGGTAGCTGATGA 


1228 


MCM6 








- 






MCM6 


NM 005915 


33469920 


4175 


D-003277-05 


GGAAAGAGCTCAGAGATGA 


1229 


Mcm 


NM 005915 


33469920 


4175 


D-003277-06 


GAGCAGCGATGGAGAAATT 


1230 


IVICM6 


NM 005915 


33469920 


4175 


D-003277-07 


GGAAACACCTGATGTCAAT 


1231 


MCM6 


NM 005915 


33469920 


4175 


D-003277-08 


CCAAACATCTGCCGAAATC 


1232 


MCM? 














MCM7 


NM 005916 


33469967 


4176 


D-003278-05 


GGAAATATCCCTCGTAGTA 


1233 


MCM7 


NM 005916 


33469967 


4176 


D-003278-06 


GGAAGAAGGAGTTCAAGTA 


1234 


MCM7 


NM 005916 


33469967 


4176 


D-003278-07 


CAACAAGCCTGGTGTGATC 


1235 


MCM7 


NM 005916 


33469967 


4176 


D-003278-08 


GGAGAGAACAGAAGGATTG 


1236 


MDM2 














MDM2 


NM 002392 


4505136 


4193 


D-003279-05 


GGAGATATGTTGTGAAAGA 


1237 


MDM2 


NM 002392 


*touo 1 oo 




D-003279-06 


CCACAAATGTGATAGTATT 


1238 


MDM2 


NM 002392 


4505136 


4193 


D-003279-07 


GATGAGGTATATGAAGTTA 


1239 


MDM2 


NM 002392 


4505136 


4193 


D-003279-08 


GGAAGAAACCCAAGACAAA 


1240 


MKI67 














MKI67 


NM 002417 


19923216 


4288 


D-003280-05 


GCAGAAAGCTTGGTTATAA 


1241 


MKI67 


NM 002417 


19923216 


4288 


D-003280-06 


GCTAAGACCTGAACTATTT 


1242 


MKI67 


NM 002417 


19923216 


4288 


D-003280-07 


CAAAGAGGAACACAAATTA 


1243 



wo 2004/045543 PCT/US2003/036787 
122 



MKI67 


NM 002417 


•IQQO'50'IC 

lyyicOiiio 




D-003280-08 


GTAAATGGGTCTGTTATTG 


1244 


MNAT1 















MNAT1 


NM 002431 






D-003281 -05 


GGAAGAAGCTTTAGAAGTG 


1245 


MNAT1 


NM 002431 






D-003281 -06 


TAGATGAGCTGGAGAGTTC 


1246 


MNAT1 


NM 002431 






D-003281 -07 


GGACCTTGCTGGAGGCTAT 


1247 


MNAT1 


NM 002431 






D-003281 -08 


GCAGATAGAGACATATGGA 


1248 


MYC 















MYC 


NM 002467 


O I04v3<il0 


^^^^ 


D-003282-05 


CAGAGAAGCTGGCCTCCTA 


1249 


MYC 


NM 002467 


31543215 


4609 


D-003282-06 


GAAACGACGAGAACAGTTG 


1250 


MYC 


NM 002467 


31 54321 5 


4609 


D-003282-07 


CGACGAGACCTTCATCAAA 


1251 


MYC 


INIVI UU^4D/ 


31543215 


4609 


D-003282-08 


CCACACATCAGCACAACTA 


1252 


ORC1L 




— 










0RC1L 


NM 004153 


31795543 


^^^^ 


D-003283-05 


GAACAGGAATTCCAAGACA 


1253 


ORC1L 


NM 0041 53 


o 1 /yoo4o 


^^^^ 


D-003283-06 


TAAGAAACGTGCTCGAGTA 


1254 


ORC1L 


NM 004153 


.3 1 / yoo4o 


^^^^ 


D-003283-07 


GAGATCACCTCACCTTCTA 


1255 


ORC1L 


NM 0041 53 


31795543 


'^^^ 


D-003283-08 


GCAGAGAGCCCTTCTTGGA 


1256 






— — — 












NM 0061 90 


32454751 


4999 


D-003284-05 


GAAGAAACCTCCTATGAGA 


1257 


Qp-g. — 


INIVI uvjo 1 yu 


32454751 


4999 


D-003284-06 


GAAGGGAACTGATGGAGTA 


1258 


ORC2i — 


INIVI UUDlaU 


32454751 


4999 


D-003284-07 


GAAGAATGATCCTGAGATT 


1259 


7^r5\ — 
■ipp:::" — 


iNivi uuDiyu 


32454751 


4999 


D-003284-08 


GAAGAGATGTTCAAGAATC 


1260 


-^P^^ — 


— 














NM 012381 


32483366 


23595 


D-003285-05 


GGACTGCTGTGTAGATATA 


1261 


QP — 


NM 012381 


32483366 


23595 


D-003285-06 


GAACTGATGACCATACTTG 


1262 


— 
-^p^ — 


NM 012381 


32483366 


23595 


D-003285-07 


AAAGATCTCTCTGCCAATA 


1263 




NM 012381 


32483366 


23595 


D-003285-08 


CAGCACAGCTAAGAGAATA 


1264 


ORrii — 




— 










orFIi — 

— 


INIVI uuzoo^ 


32454749 


5000 


D-003286-06 


GAAAGCACATTCCGTTTAT 


1265 




NM 002552 


32454749 


5000 


D-003286-07 


TGAAAGAACTCATGGAAAT 


1266 




NM 002552 


32454749 


5000 


D-O03286-O8 


GCTGAGAAGTGGAATGAAA 


1267 




NM 002552 


32454749 


5000 


D-003286-09 


CCAGTGATCTTCATATTAG 


1268 



















NM ;002553 


32454752 


5001 


D-003287-05 


GAAATAACCTGTGAAACAT 


1269 


cTRT^i — 

— 


NM 002553 


32454752 


5001 


D-003287-06 


CAGATTACCTCTCTAGTGA 


1270 




NM 002553 


32454752 


5001 


D-003287-07 


GAACTTCCATATTACTCTA 


1271 


■ — 


INm UU^OOo 


32454752 


5001 


D-003287-08 


GTATTCAGCTGATTTCTAT 


1272 


■^p- — 




. , 










0RC6L- 


INIVI u 140^:1 


32454755 


23594 


D-003288-05 


GMCATGGCTTCAAAGATA 


1273 




MM H'lA^O'X 
INIVI VJ l*fO^ 1 


32454755 


^■^^^^ 


D-003288-06 


GGACAGGGCTTATTTAATT 


1274 


0RC6L 


INIVI Ul4J^1 


32454755 


23594 


D-003288-07 


GAAAGAAGATAGTGGTTGA 


1275 


0RC6L 


INIVI U i*tOZ 1 


32454755 


23594 


D-003288-08 


TATCAGAGCTGTCTTAAAT 


1276 


PCNA 

















NM 002592 


33239449 


5111 


D-003289-05 


GATCGAGGATGAAGAAGGA 


1277 


rc^A — 


NM 002592 


33239449 


5111 


D-0D3289-07 


GCCGAGATCTCAGCCATAT 


1278 


Pf^NA 


INIVI uu^oy^ 


33239449 


5111 


D-003289-09 


GAGGCCTGCTGGGATATTA 


1279 


PCNA 


INIVI uu^oy^ 


33239449 


5111 


D-003289-10 


GTGGAGAACTTGGAAATGG 


1280 


PLK 




— — 










PLK 


NM 005030 


21359872 


5347 


D-003290-05 


CAACCAAAGTCGAATATGA 


1281 


PLK 


NM 005030 


21359872 


5347 


D-003290-06 


CAAGAAGAATGAATACAGT 


1282 


PLK 


NM 005030 


21359872 


5347 


D-003290-07 


GAAGATGTCCATGGAAATA 




PLK 


NM 005030 


21359872 


5347 


D-003290-08 


CAACACGCCTCATCCTCTA 


1284 


PIN1 














PIN1 


NM 006221 


5453897 


5300 


D-003291-05 


GGACCAAGGAGGAGGCCCT 


1285 


PIN1 


NM 006221 


5453897 


5300 


D-003291-06 


CGTCCTGGCGGCAGGAGAA 


1286 


P1N1 


NM 006221 


5453897 


5300 


D-003291-07 


CGGGAGAGGAGGACT7TGA 


1287 


PIN1 


NM 006221 


5453897 


5300 


D-003291-08 


AGTCGGGAGAGGAGGACTT 


1288 
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P1N1L 














PIN1L 


NM 006222 


5453899 


5301 


D-003292-06 


CGACCTGGCGGCAGGAAAT 


1289 


PIN1L 


NM 006222 


5453899 


5301 


D-003292-07 


AGGCAGGAGAGAAGGACTT 


1290 


PIN1L 


NM 006222 


5453899 


5301 


D-003292-08 


GCTACATCCAGAAGATCAA 


1291 


PIN1L 


NM 006222 


5453899 


5301 


D-003292-09 


GGACAGTGTTCACGGATTC 


1292 


RAD1 














RAD1 


NM 002853 


19718797 


5810 


D-003293-05 


GAAGATGGACAAATATGTT 


1293 


RAD1 


NM 002853 


19718797 


5810 


D-003293-06 


GGAAGAGTCTGTTACTTTT 


1294 


RAD1 


NM 002853 


19718797 


5810 


D-003293-07 


GATAACAGAGGCTTCCTTT 


1295 


RAD1 


NM 002853 


19718797 


5810 


D-003293-08 


GCATTAGTGCTATCTTGTA 


1296 


RAD17 














RAD17 


NM 133338 


19718783 


5884 


D-003294-05 


GAATCAAGCTTCCATATGT 


1297 


RAD17 


NM 133338 


19718783 


5884 


D-003294-06 


CAACAAAGCCCGAGGATAT 


1298 


RAD17 


NM 133338 


19718783 


5884 


D-003294-07 


ACAGATGCCTGGAGACTTA 


1299 


RAD17 


NM 133338 


19718783 


5884 


D-003294-08 


CTACATAGATTTCTTGATG 


1300 


RAD9A 














RAD9A 


NM 004584 


19924112 


5883 


D-003295-05 


TCAGCAAACTTGAATCTTA 


1301 


RAD9A 


NM 004584 


19924112 


5883 


D-003295-06 


GACATTGACTCTTACATGA 


1302 


RAD9A 


NM 004584 


19924112 


5883 


D-003295-08 


GGAAACCACTATAGGCAAT 


1303 


RAD9A 


NM 004584 


19924112 


5883 


D-003295-09 


CGGACGACTTTGCCAATGA 


1304 


RB1 














RB1 


NM 000321 


19924112 


5925 


D-003296-05 


GAAAGGACATGTGAACTTA 


1305 


RB1 


NM 000321 


19924112 


5925 


D-003296-06 


GAAGAAGTATGATGTATTG 


1306 


RB1 


NM 000321 


4506434 


5925 


D-003296-07 


GAAATGACTTCTACTCGAA 


1307 


RB1 


NM 000321 


4506434 


5925 


D-003296-08 


GGAGGGAACATCTATATTT 


1308 


RBBP2 














RBBP2 


NM 005056 


4826967 


5927 


D-003297-05 


CAAAGAAGCTGAATAAACT 


1309 


RBBP2 


NM 005056 


4826967 


5927 


D-003297-06 


CAACACATATGGCGGATTT 


1310 


RBBP2 


NM 005056 


4826967 


5927 


D-003297-07 


G6ACAAACCTAGAAAGAAG 


1311 


RBBP2 


NM 005056 


4826967 


5927 


D-003297-08 


GAAAGGCACTCTCTCTGTT 


1312 


RBL1 














RBL1 


NM 002895 


34577078 


5933 


D-003298-05 


CAAGAGAAGTTGTGGCATA 


1313 


RBL1 


NM 002895 


34577078 


5933 


D-003298-06 


CAGCAGCACTCCATTTATA 


1314 


RBL1 


NM 002895 


34577078 


5933 


D-003298-07 


ACAGAAAGGTGTATCATTT 


1315 


RBL1 


NM 002895 


34577078 


5933 


D-003298-08 


GGACATAAAGTTACAATTG 


1316 


RBL2 














RBL2 


NM 005611 


21361291 


5934 


D-003299-05 


GAGCAGAGCTTAATCGAAT 


1317 


RBL2 


NM 005611 


21361291 


5934 


D-003299-06 


GAGAAIAGCCCTTGTGTGA 


1318 


RBL2 


NM 005611 


21361291 


5934 


D-003299-07 


GGACTTAGTTTATGGAAAT 


1319 


RBL2 


NM 005611 


21361291 


5934 


D-O03299-O8 


GAATTTAGATGAGGGGATA 


1320 


RBP1 














RBP1 


NM 002899 


8400726 


5947 


D-003300-05 


GAGACAAGCTCCAGTGTGT 


1321 


RBP1 


NM 002899 


8400726 


5947 


D-003300-06 


GCAAGCAAGTATTCAAGAA 


1322 


RBP1 


NM 002899 


8400726 


5947 


D-003300-07 


GCAGGACGGTGAGGATATG 


1323 


RBP1 


NM 002899 


8400726 


5947 


D-003300-08 


GCAAGTGCATGACAACAGT 


1324 


RPA3 














RPA3 


NM 002947 


19923751 


6119 


D-003322-05 


GGAAGTGGTTGGAAGAGTA 


1325 


RPA3 


NM 002947 


19923751 


6119 


D-003322-06 


GAAGATAGCCATCCTTTTG 


1326 


RPA3 


NM 002947 


19923751 


6119 


D-003322-07 


CATGCTAGCTCAATTCATC 


1327 


RPA3 


NM 002947 


\Vt7£.Of 01 




D-003322-08 


GATGTTGGACTTTACAATG 


1328 


SKP1A 














SKP1A 


NM 006930 


25777710 


6500 


D-003323-05 


GGAGAGATATTTGAAGTTG 


1329 


SKP1A 


NM 006930 


25777710 


6500 


D-003323-06 


GGGAATGGATGATGAAGGA 


1330 


SKP1A 


NM 006930 


25777710 


6500 


D-003323-07 


CAAACAATCTGTGACTATT 


1331 


SKP1A 


NM 006930 


25777710 


6500 


D-003323-08 


TCAATTAAGTTGCAGAGTT 


1332 


SKP2 
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SKP2 


NM 005983 


1 6306594 


— 


T— 

D-003324-05 


CATCTAGACTTAAGTGATA 


1333 


SKP2 


iNivi uuoyoo 


1 6306594 


6502 


D-003324-06 


GAAATCAGATCTCTCTACT 


1334 


SKP2 


NM 005983 


1 DoUDOyn 


^^^^ 


D-003324-07 


CTAAAGGTCTCTGGTGTTT 


1335 


SKP2 


NM 005983 


16306594 




D-003324-08 


GATGGTACCCTTCAACTGT 


1336 


SNK 






— — — 









SNK 


NM 006622 


0 1 OUU04 




D-003325-05 


GAAGACATCTACAAGCTTA 


1337 


SNK 


NM 006622 


5730054 




D-003325-06 


GAAATAGCTTCATGAACAA 


1338 


SNK 


NM 006622 


Of OUU04 




D-003325-Q7 


GAAGGTCAATGGCTCATAT 


1339 
1340 


SNK 


NM 006622 


5730054 




D-003325-08 


CCGGAGATCTCGCGGATTA 


STK12 






— 








STK12 


NM 004217 




^^^^ 


D-003326-07 


CAGAAGAGCTGCACATTTG 


1341 


STK12 


NM 004217 






D-003326-08 


CCAAACTGGTCAGGCATAA 


1342 


STK12 


NM 004217 


47591 77 




D-003326-09 


ACGCGGCACTTCACAATTG 


1343 


STK12 


INIVI UU4^ 1 / 


4759177 


9212 


D-003326-10 


TGGGACACCCGACATCTTA 


1344 


TFDP1 















TFDP1 


NM 0071 1 1 


34147667 


^^^^ 


D-003327-05 


GGAAGCAGCTCTTGCCAAA 


1345 


TFDP1 


INIVl UUA 1 1 1 


34147667 


7027 


D-003327-06 


GAGGAGACTTGAAAGAATA 


1346 


TFDP1 


KIM nn7i ^ -1 

INIVI UU/ 1 1 1 


34147667 


7027 


D-003327-07 


GAACTTAGAGGTGGAAAGA 


1347 


TFDP1 


MM nnv-i-i-i 

INIVI UU/ 1 1 1 


34147667 


7027 


D-003327-08 


GCGAGAAGGTGCAGAGGAA 


1348 


TFDP2 















TFDP2 


INIVI UUO^Ou 


54541 1 1 


7029 


D-003328-05 


GAAAGTGTGTGAGAAAGTT 


1349 




INIVI UUDiiOD 


54541 1 1 


7029 


D-003328-06 


CACAGGACCTTCTTGGTTA 


1350 


TFDP2 


NM 006286 


54541 1 1 


7029 


D-003328-07 


CGAAATCCCTGGTGCCAAA 


1351 


TFDP2 


NM 006286 


54541 1 1 


7029 


D-003328-08 


TGAGATCCATGATGACATA 


1352 


TP53 















TP53 


INIVI UUUD40 


8400737 


7157 


D-003329-05 


GAGGTTGGGTCTGACTGTA 


1353 


TP53 


MM nnnc;>iR 

INIVI UUU04D 


8400737 


7157 


D-003329-06 


CAGTCTACCTCCCGGCATA 


1354 


TP53 


MM r\nnczAR 

INIVI UUU04D 


8400737 


7157 


D-003329-07 


GCACAGAGGAAGAGAATCT 


1355 




NM 000546 


8400737 


7157 


D-003329-08 


GAAGAAACCAGTGGATGGA 


1356 


TP63 















Tpfi — 


INIVI KJKJoiZ^ 


31 54381 7 


8626 


D-003330-05 


CATCATGTCTG6ACTATTT 


1357 




INIVI UUO/^Z 


31543817 


8626 


D-003330-06 


CAAACAAGATTGAGATTAG 


1358 


TP63 


iNlVI UUO/ZZ- 


:31543817 


8626 


D-003330-07 


GCACACAGACAAATGAATT 


1359 


TP63 


MM HAITOO 
INIVl UUo/i£^ 


31543817 


8626 


D-003330-08 


CGACAGTCTTGTACAATTT 


1360 


TP73 














TP73 


NM 005427 


4885644 


7161 


D-003331-05 


GCAAGCAGGCCATCAAGGA 




TP73 


NM 005427- 


4885644 


7161 


D-003331-06 


GAGACGAGGACAGGTACTA 


1362 


TP73 


NM 005427 


4885644 


7161 


D-003331-07 


CTGCAGAACCTGAGCATTG 




TP73 


NM 005427 


4885644 


7161 


D-003331-08 


GGCCATGCCTGTTTACAAG 


1364 


YWHAZ 














YWHAZ 


NM 003406 


21735623 


7534 


D-003332-05 


GCAAGGAGCTGAATTATCC 


1365 


YWHAZ 
YWHAZ 


NM 003406 
NM 003406 


21735623 
21735623 


7534 
7534 


D-003332-06 


TAAGAGATATCTGCAATGA 


1366 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-07 
D-003332-08 


GACGGAAGGTGCTGAGAAA 
AGAGCAAAGTCTTCTATTT 


1367 
1368 



Table IX 



Gene 
Name 


Accession # 


Gl# 


Duplex # 


Sequence 


SEQ. ID 
NO. 


AR 


NM 000044 


21322251 


D-003400-01 


GGAACTCGATCGTATCATT 


1369 


AR 


NM 000044 


21322251 


D-003400-02 


CAAGGGAGGTTACACCAAA 


1370 


AR 


NM 000044 


21322251 


D-003400-03 


TCAAGGAACTCGATCGTAT 


1371 


AR 


NM 000044 


21322251 


D-003400-04 


GAAATGATTGCACTATTGA 


1372 














ESR1 


NM 000125 


4503602 


D-003401-01 


GAATGTGCCTGGCTAGAGA 


1373 


ESR1 


NM 000125 


4503602 


D-003401-02 


CATGAGAGCTGCCAACCTT 


1374 


ESR1 


NM 000125 


4503602 


D-003401-03 


AGAGAAAGATTGGCCAGTA 


1375 
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ESR1 I" 


vlM 000125 '■ 


503602 1 


3-003401 -04 


ilAAGGAGACTCGCTACTGT 


1376 














ESR2 


^IM 001437 


0835012 


D-003402-01 


3 AAC AT GT G CT C AAC AT G A 


1377 


ESR2 


*JM 001437 


083501 2 


D-003402-02 


3 CACGGCTCC AT ATACAT A 




ESR2 


001437 


0835012 


D-003402-03 






ESR2 


vlM 001437 


083501 2 




SGAAATGCGTAGAAGGAAT 


1380 














ESRRA 


004451 




8860919 


3-OO3403-01 




"1381 — 


ESRRA 


vlM 004451 


18860919 


5-003403-02 


rriAATfiPA CTCX riTriTPTP A 




ESRRA 


vlM 004451 


18860919 


3-003403-03 


r> n A TTri A f2 P PTPTPT A O AT 




ESRRA 


NM 004451 


18860919 


3-003403-04 


p p A p A p A p p r* A A A riTn; 

















ESRRB 


NM 004452 


22035686 


3-003404-01 


T A r^t^TC^ A C- C^TT t^r^ A A A 1 1 1 
1 AL.L» 1 laAoL. 1 1 AL>AAA 1 1 1 




ESRRB 


004452 


22035686 


3-003404-02 






ESRRB 


MM 004452 


22035686 


3-003404-03 


L/AAL. 1 L.L.OA 1 1 \j\tl\ 1 O 1 P\j 


- 


ESRRB 


NM 004452 


22035686 


D-003404-04 


nP APTPPPPAPPP ATI^TTT 















— — — — 


ESRRG 


NM 001438 


4503604 


D-003405-01 


AAACAAAGATGGACACATT 




ESRRG 


NM 001438 


4503604 


D-003405-02 


T/^ A/"^/"* A A A PXPX ATP ATrf^ A 

TOA(jt>AAAO 1 1> 1 A 1 w\ 1 t>A 


ifai — 


ESRRG 


NM 001438 


4503604 


D-003405-03 


GAAGACCAGTCCAAA I I AG 




ESRRG 


NM 001438 


4503604 


D-003405-04 


ATGAAGCGCTGCAGGA 1 1 A 


10Q0 

— 1392_ 














HNF4A 


NM 000457 


21361184 


D-003406-01 


^„ 

GGACATCAGTGGAGCATAT 


iOQO 

ionl" 


HNF4A 


NM 000457 


21361184 


D-003406-02 


GAAGGAAGCCGTCCAGAAT 


— ioq2 ' 


HNF4A 


NM 000457 


21361184 


D-003406-03 


CCAAGTACATCCCAGCT TT 


— 1392 — 


HNF4A 


NM 000457 


21361184 


D-003406-04 


GGACATGGGCGACTACAGT 


^^^^ 














HNF4G 


NM 004133 


6631 087 


D-003407-01 


GCACTGACATAAACGTTAA 


10Q7 

- 


HNF4G 


NM 004133 


6631 087 


D-003407-02 


ACAAAGAGATCCATGATGT 




HNF4G 


NM 004133 


6631087 


D-003407-03 


AGAGATCCATGATGTATAA 




HNF4G 


NM 004133 


6631087 


D-003407-04 


AAATGAACGTGAGAGAATA 
















HSAJ2425 


NM 017532 


8923776 


D-003408-01 


GAATGAATCTAGACC I I I G 


— ^dM — 
— TT^i — 


HSAJ2425 


NM 017532 


8923776 


D-003408-02 


GGAAATACGTGGAGACACT 


— 


HSAJ2425 


NM 017532 


8923776 


D-003408-03 


CCAGATAACTACGGGGATA 


^^^^ 


HSAJ2425 


NM 017532 


8923776 


D-003408-04 


TGGGGTAGCTTCTCATTGA 


— 1404 — 















NR0B1 


NM 000475 


5016089 


D-003409-01 


CAGCATGGATGATAT.GATG 


— TJr^ 
— ^nft 


NR0B1 


NM 000475 


5016089 


D-003409-02 


CTGCTGAGATTCATCAATG 


— 1406 


NR0B1 


NM 000475 


5016089 


D-003409-03 


ACAGATTCATCGAACTTAA 


— ^^nft 


NR0B1 


NM 000475 


5016089 


D-003409-04 


GAACGTGGCGCTCCTGTAC 


— 140§ — 














NR0B2 


NM 021969 


13259502 


D-003410-01 


GAATATGCCTGCCTGAAAG 


— TZnq~ 


NR0B2 


NM 021969 


1 3259502 


D-003410-02 


GGAATATGCCTGCCTGAAA 


— lAin 


NR0B2 


NM 021969 


1 3259502 


D-003410-03 


PPT A PPPPPT/iPPT AT/^XA 


1411 — 


NR0B2 


NM 021969 


13259502 


D-003410-04 


/-^ /-«/". ATTPTPX APP P APXTP 


1£! 














NR1D1 


NM 021724 


13430847 


D-00341 1-01 


CAACACAGGTGGCGTCATCTT 


— TZiT~ 


NR1D1 


NM 021724 


13430847 


D-0034 11-02 


GGGATGGTGTTACTGTGTA 1 1 


— 1414 


NR1D1 


NM 021724 


1 3430847 


D-00341 1-03 


CAACATGCATTCCGAGAAGTT 


1415 


NR1D1 


NM 021724 


13430847 


D-00341 1-04 


GCGCTTTGCTTCGTTGTTCTT 


1416 














NR1H2 


NM 007121 


1132162S 


D-00341 2-01 


GAACAGATCCGGAAGAAGA 


1417 


NR1H2 


NM 007121 


1132162? 


D-00341 2-02 


GAAGAACAGATCCGGAAGA 


1418 


NR1H2 


NM 007121 


1132162J 


) D-00341 2-02 


CTAAGCAAGTGCCTGGTTT 


1419 


NR1H2 INM 007121 


1132162( 


) D-00341 2-04 IGCTAACAGCGGCTCAAGAA 


1420 
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— 










NR1 H3 


NM 005693 


5031892 


D-0034 13-01 


GAACAGATCCGCCTGAAGA 


1421 


j^^^ 


NM 005693 


5031892 


D-0034 13-02 


GGAGATAGTTGACTTTGCT 


1422 




NM 005693 


5031892 


D-003413-03 


GAGT 1 [GCCTTGCTCATTG 


1423 


NR1 H3 


NM 005693 


5031892 


D-0034 13-04 


TGACTTTGCTAAACAGCTA 


1424 




— — 










j^P^^^^^ 


NM 005123 


4826979 


D-003414-01 


CAAGTGACCTCGACAACAA 


1425 


NR1 H4 


NM 005123 


4826979 


D-0034 14-02 


GAAAGAATTCGAAATAGTG 


1426 


— 


NM 005123 


4826979 


D-0034 14-03 


CAAGAGACTCTTCTACATT 


1427 


tlf 


NM 005123 


4826979 


D-O03414-04 


GAACCATACTCGCAATACA 


1428 





— 










"^Rilf 


NM 003889 


11863133 


D-0Q3415-01 


GAACCATGCTGACTTTGTA 


1429 


MRlH 


NM 003889 


11863133 


D-0034 15-02 


GATGGAGGCTCAGATGAAA 


1430 


-— III 


NM 003889 


11863133 


D-003415-03 


CAACCTACATGTTCAAAGG 


1431 


■ ■ 


NM 003889 


11863133 


D-003415-04 


CAGGAGCAATTCGGCATTA 


1432 





■— 










"^Riii — 


NM 005122 


4826660 


D-003416-01 


GGAAATCTGTCACATCGTA 


1433 


wiiil 


NM 005122 


4826660 


D-0034 16-02 


TCGCAGACATCAACACTTT 


1434 




NM 005122 


4826660 


D-003416-03 


CCTCTTCGCTACACAATTG 


1435 


MR1I 




NM 005122 


4826660 


D-003416-04 


GAACAGriTGTGCAGTTTA 


1.436 


— 












^^^^ 


NM 003297 


4507672 


D-003417-01 


TGACAGCACTTGATCATAA 


1437 


^^^^ 


NM 003297 


4507672 


D-003417-02 


GGAAGGAAGTGTACACCTA 


1438 


^^^^^ 


NM 003297 


4507672 


D-003417-03 


GAGCACATCTTGAAACTAC 


1439 


LJ — 


NM 003297 


4507672 


D-003417-04 


GAAGAAATTGCAGATCAAA 


1440 














NR2C2 


NM 003298 


4507674 


D-003418-01 


GAACAACGGTGACACTTCA 


1441 


]^^^^^ 


NM 003298 


4507674 


D-003418-02 


CTGATGAGCTCCAACATAA 


1442 


^2^^^ 


NM 003298 


4507674 


D-0034 18-03 


CAAGCTAAGTGAATCTTTG 


1443 


'^^^^^ 


NM 003298 


4507674 


D-003418-04 


GAAGACACCTACCGATTGG 


1444 














NR2E1 

NR2E1 


NM 003269 
NM 003269 


21361108 
21361^108 


D-003419-01 
D-003419-02 


GATCATATCTGAAATACAri 
CA Afi APTf^PTTTn A AT AT 


1445 


NR2E1 


NM 003269 


21361108 


D-003419-03 


wAwvjMV^ 1 1 1 1 L^AoA 1 A 1 
GTTAGATGCTACTGAATTT 


1446 
1447 


^^^^ 


NM 003269 


21361108 


D-003419-04 


CAATGTATCTCTATGAAGT 


1448 


— 












^^^^ 


NM 014249 


7657394 


D-003420-01 


GAGAAGCTCCTTTGTGATA 


1449 


^^^^^ 


NM 014249- 


7657394 


D-003420-02 


GAAGCACTATGGCATCXAT- 


1450 


^^^^•^ 


NM 014249 
NM 014249 


7657394 
7657394 


D-003420-03 
D-003420-04 


GAAGGATCCTGAGCACGTA 


1451 











GAAGCTCCTTTGTGATATG 


1452 


^onpl 


NM 005654 


20127484 


D-003421-01 


GAAACTCTCATGCGCGATA 


1453 


^Zp] 


NM 005654 


20127484 


D-003421-02 


TCTCATCCGCGATATGTTA 


1454 


nrHi 


NM 005654 


20127484 


D-003421-03 


CAAGAAGTGCCTCAAAGTG 


1455 





NM_005654 


20127484 


D-003421-04 


GGAACTTAACTTACACATG 


1456 





— 










[j^^^ 


MM 021005 


14149745 


D-003422-01 


GTACCTGTCGGGATATATT 


1457 


j^^p^ 


MM 021005 


14149745 


D-003422-02 


CCAACCAGCCGACGAGATT 


1458 




MM 021005 


14149745 


D-003422-03 


ACTCGTACCTGTCCGGATA 


1459 


NR2F2 


NM 021005 


14149745 


D-003422-04 


GGCCGTATATGGGAATTCA 


1460 














NR2F6 


NM 005234 


20070198 


D-003423-01 


CGACGCGTGTGGCCTCTCA 


1461 


NR2F6 


NM 005234 


20070198 


D-003423-02 


CAGCCGGTGTCCGAACTGA 


1462 


NR2F6 


NM 005234 


20070198 


D-003423-03 


CAACCGTGAGTGGCAGATC 


1463 


NR2F6 


MM 005234 


20070198 


D-003423-O4 


GTACTGCCGTCTCAAGAAG 
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NR3C1 


NM 000176 


4504132 


D-003424-01 


GAGGACAGATGTACCACTA 


1465 


NR3C1 


NM 000176 


4504132 


D-003424-02 


GATAAGACGATGAGTATTG 


1466 


NR3C1 


NM 000176 


4504132 


D-003424-03 


GAAGACGATTCATTCCTTT 


1467 


NR3C1 


NM 000176 


4504132 


D-003424-04 


GGACAGATGTACCACTATG 


1468 














NR3C2 


NM 000901 


4505198 


D-003425-01 


GGAAACAGATGATCCAAGT 


1469 


NR3C2 


NM 000901 


4505198 


D-003425-02 


CAGCTAAGATTTATCAGAA 


1470 


NR3C2 


NM 000901 


4505198 


D-003425-03 


GCACGAAAGTCAAAGAAGT 


1471 


NR3C2 


NM 000901 


4505198 


D-003425-04 


GGTATGCGGTCTTAGAATA 


1472 














NR4A1 


NM 002135 


21361341 


D-003426-01 


GAAGGAAGTTGTCCGAACA 


1473 


NR4A1 


NM 002135 


21361341 


D-003426-02 


CAGGAGAGTTTGACACCTT 


1474 


NR4A1 


NM 002135 


21361341 


D-003426-03 


GAGTGGCTCTGACTACTAT 


1475 


NR4A1 


NM 002135 


21361341 


D-003426-04 


GAAGGCCGGTGTGCT6TGT 


1476 














NR4A2 


NM 006186 


5453821 


D-003427-01 


GCAATGCGTTCGTGGCTTT 


1477 


NR4A2 


NM 006186 


5453821 


D-003427-02 


CGGCTACACAGGAGAGTTT 


1478 


NR4A2 


NM 006186 


5453821 


D-003427-03 


CCACGTGACTTTCAACAAT 


1479 


NR4A2 


NM 006186 


5453821 


D-003427-04 


GAATACAGCTCCGATTTGT 


1480 














NR4A3 


NM 006981 


11276070 


D-003428-01 


CAAAGAAGATCAGACATTA 


1481 


NR4A3 


NM 006981 


11276070 


D-003428-02 


GATCAGACATTACTTATTG 


1482 


NR4A3 


NM 006981 


11276070 


D-003428-03 


CCAGAGATCTTGATTATTC 


1483 


NR4A3 


NM 006981 


11276070 


D-003428-04 


GAAGTTGTCCGTACAGATA 


1484 














NR5A1 


NM 004959 


20070192 


D-003429-01 


GATTTGAAGTTCCTGAATA 


1485 


NR5A1 


NM 004959 


20070192 


D-00 3429-02 


GGAGCGAGCTGCTGGTGTT 


1486 


NR5A1 


NM 004959 


20070192 


D-003429-03 


GGAGGTGGCCGACCAGATG 


1487 


NR5A1 


NM 004959 


20070192 


D-OO3429-04 


CAAGGTGCCTGAGCTCATG 


1488 














NR5A2 


NM 003822 


20070161 


D-003430-01 


CCAAACATATGGCCACTTT 


1489 


NR5A2 


NM 003822 


20070161 


D-003430-02 


TCAGAGAACTTAAGGTTGA 


1490 


NR5A2 


NM 003822 


2007016-1-^ 


'D-003430-03 


GGATCCATCTTCCTGGTTA 


1491 


NR5A2 


NM 003822 


20070161 


D-003430-04 


AAGAATACCTCTACTACAA 


1492 














NR6A1 


NM 033334 


15451847 


D-003431-01 


CAACGAACCTGTCTCATTT 


1493 


NR6A1 


NM 033334 


15451847 


D-003431-02 


GAAGAACTACACAGATTTA 


1494 


NR6A1 


NM 033334 


15451847 


D-003431-03 


GAAGATGGATACGCTGJGA. 


1495 


NR6A1 


NM 033334 


15451847 


D-003431-04 


AAACGATACTGGTACATTr- 


1496 














null 


D16815 


2116671 


D-003432-01 


GAAGAATGATCGAATAGAT 


1497 


null . 


D16815 


2116671 


D-003432-02 


GAACATGGAGCAATATAAT 


1498 


null 


D16815 


2116671 


D-003432-03 


GAGGAGCTCTTGGCCTTTA 


1499 


null 


D16815 


2116671 


D-003432-04 


TAAACAACATGCACTCTGA 


1500 














PGR 


NM 000926 


4505766 




GAGATGAGGTCAAGCTACA 


1501 


PGR 


NM 000926 


4505766 


D-003433-02 


CAGCGTTTCTATCAACTTA 


1502 


PGR 


NM 000926 


4505766 


D-003433-03 


AGATAACTGTCATTCAGTA 


1503 


PGR 


NM 000926 


4505766 


D-003433-04 


GTAGTGAAGTGGTCTAAAT 


1504 














PPARA 


NM 005036 


7549810 


D-003434-01 


TCACGGAGCTCACGGAATT 


1505 


PPARA 
PPARA 


NM. 005036 
NM 005036 


7549810 


D-003434-02 


GAACATGACATAGAAGATT 


1506 


PPARA 


NM 005036 


7549810 
7549810 


D-003434-03 
D-003434-04 


GGATAGTTCTGGAAGCTTT 
GACTCAAGCTGGTGTATGA 


1507 
1508 














PPARD 


NM 006238 


5453939 


D-003435-01 


GAGCGCAGCTGCAAGATTC 


1509 
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NM 006238 


5453939 


D-003435-02 


GCATGAAGCTGGAGTACGA 


1510 


PPAPn 

pp^*^^ 


NM 006238 


5453939 


D-003435-03 


GGAAGCAGTTGGTGAATGG 


1511 


'^'^^'^^ 


NM 006238 


5453939 


D-003435-04 


GCTGCAAGATTCAGAAGAA 


1512 


— 












PPARG 


NM 138712 


20336234 


D-003436-01 


AGACTCAGCTCTACAATAA 


1513 


PPARG 


NM 138712 


20336234 


D-003436-02 


GATTGAAGCTTATCTATGA 


1514 


PPARG 


NM 138712 


20336234 


D-003436-03 


AAGTAACTCTCCTCAAATA 


1515 


PPARG 


NM 138712 


20336234 


D-003436-04 


GCAI 1 ICTACTCCACATTA 


1516 


— 












^P 


NM 000964 


4506418 


D-003437-01 


GACAAGAACTGCATCATCA 


1517 


RARA 


NM 000964 


4506418 


D-003437-02 


GCAAATACACTACGAACAA 


1518 


RARA 


NM 000964 


4506418 


D-003437-03 


GAACAACAGCTCAGAACAA 


1519 


RARA 


NM 000964 


4506418 


D-003437-04 


GAGCAGCAGTTCTGAAGAG 


1520 














RARB 


NM 000965 


14916493 


D-003438-01 


GCACACTGCTCAATCAATT 


1521 


RARB 


NM 000965 


14916493 


D-003438-02 


GCAGAAGTATTCAGAAGAA 


1522 


RARB 


NM 000965 


14916493 


D-003438-03 


GGAATGACAGGAACAAGAA 


1523 


RARB 


NM 000965 


14916493 


D-003438-04 


GCACAGTCCTAGCATCTCA 


1524 














RARG 


NM 000966 


21359851 


D-003439-01 


GAAATGACCGGAACAAGAA 


1525 


RARG 


NM 000966 


21359851 


D-003439-02 


TAGAAGAGCTCATCACCAA 


1526 


RARG 


NM 000966 


21359851 


D-003439-03 


CAAGGAAGCTGTGCGAAAT 


1527 


RARG 


NM 000966 


21359851 


D-003439-04 


TCAGTGAGCTGGCTACCAA 


1528 














RORA 


NM 134261 


19743902 


D-003440-01 


GGAAAGAGTTTATGTTCTA 


1529 


RORA 


NM 134261 


19743902 


D-003440-02 


CAAGATCTGTGGAGACAAA 


1530 


RORA 


NM 134261 


19743902 


D-003440-03 


GCACCTGAGTGAAGATGAA 


1531 


RORA 


NM 134261 


19743902 


D-003440-04 


CCGAGAAGATGGAATACTA 


1532 














RORB 


NM 006914 


19743906 


D-003441-01 


GCACAGAACATCATTAAGT 


1533 


RORB 


NM 006914 


19743906 


D-003441-02 


CCACACCTATGAAGAAATT 


1534 


RORB 


NM 006914 


19743906 


D-003441-03 


GATCAAATTCTACTTCTGA 


1535 


RORB 


NM 006914 


19743906 


O-J1D03441-04 


TCAAACAGATAAAGCAAGA 


1536 














RORC 


NM 005060 


19743908 


D-003442-01 


TAGAACAGCTGCAGTACAA 


1537 


RORC 


NM 005060 


19743908 


D-003442-02 


TCACCGAGGCCATTCAGTA 


1538 


RORC 


NM 005060 


19743908 


D-003442-03 


GAACAGCTGCAGTACAATC 


1539 


RORC 


NM 005060 


19743908 


D-003442-04 


CCTCATGCCACCTTGAATA . 


.1540 










,- 




RXRA 


NM 002957 


21536318 


D-003443-01 


TGACGGAGCTTGTGTCCAA 


1541 


RXRA 


NM 002957 


21536318 


D-003443-02 


CAACAAGGACTGCCTGATT 


1542 


RXRA 


NM 002957 


21536318 


D-003443-03 


GCAAGGACCTGACCTACAG 


1543 


RXRA 


NM 002957 


21536318 


D-003443-04 


GCAAGGACCGGAACGAGAA 


1544 














RXRB 


NM 021976 


21687229 


D-003444-01 


GCAAAGACCTTACATACTC 


1545 


RXRB 


NM 021976 


21687229 


D-003444-02 


GCAATCATTCTGTTTAATC 


1546 


RXRB 


NM 021976 


21687229 


D-003444-03 


TCACACCGATCCATTGATG 


1547 


RXRB 


NM 021976 


21687229 


D-003444-04 


GCAAACGGCTATGTGCAAT 


1548 














RXRG 


NM 006917 


21361386 


n f\n*\AAR Hi 

U-U UOH'f O-U 1 


GGAAGGACCTCATCTACAC 


1549 


RXRG 


NM 006917 


21361386 


D-O03445-O2 


CCGGATCTCTGGTTAAACA 


1550 


RXRG 


NM 006917 


21361386 


D-003445-O3 


GCGAGCCATTGTACTCTTT 


1551 


RXRG 


NM 006917 


21361386 


D-003445-04 


GAGCCATTGTACTCTTTAA 


1552 














THRA 


NM 003250 


20127451 


D-003446-G1 


GGACAAAGACGAGCAGTGT 


1553 


THRA 


NM 003250 


20127451 


D-003446-02 


GGAAACAGAGGCGGAAATT 


1554 
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THRA 


NM 003250 


20127451 


D-003446-03 


GTAAGCTGATTGAGCAGAA 


1555 


THRA 


NM 003250 


20127451 


D-003446-04 


GAACCTCCATCCCACCTAT 


1556 














THRB 


NM 000461 


10835122 


D-003447-01 


GAATGTCGCTTTAAGAAAT 


1557 


THRB 


NM 000461 


10835122 


D-003447-02 


GAACAGTCGTCGCCACATC 


1558 




NM 000461 


10835122 


D-003447-03 


GGACAAGCACCAATAGTCA 


1559 


THRB 


NM 000461 


10835122 


D-003447-04 


GTGGAAAGGTTGACTTGGA 


1560 














VDR 


NM 000376 


4507882 


D-003448-01 


TGAAGAAGCTGAACTTGCA 


1561 


VDR 


NM 000376 


4507882 


D-003448-02 


GCAACCAAGACTACAAGTA 


1562 


VDR 


NM 000376 


4507882 


D-003448-03 


TCAATGCTATGAGCTGTGA 


1563 


VDR 


NM 000376 


4507882 


D-003448-04 


CCATTGAGGTCATCATGTT 


1564 



Gene 

Symbol 

ABCB1 





; SECg;IDNO.-> 


GACCAUAAAUGUAAGGUUU 


1565 


UAGAAGAUCUGAUGUCAAA 


1566 


GAAAUGUUCACUUCAGUUA 


1567 


GAAGAUCGCUACUGAAGCA 


1568 




'A Sense ' 




GGAAGCAACUGCAGAGACA 


1569 


GAUGACACCUCUGAACAAA 


1570 


UAAAGUUGCUGAUCAAGUU 


1571 


CAAGGAGUCUGCCGAAGGA 


1572 




Sense J- s ' :4s 


GC AGAUGCCUUCUUCGUUA 


1573 


AGGCAAAUCUUCGUUAUUA 


1574 


GGGAAGAAAUCUGGUCUAA 


1575 


UGACUCAUCCCAACAUUUA 


1576 








CCGACGUGCUGCCUGAGUA 


1577 


GAGAAGAGCAGCGACAGUU 


1578 


GAUCAUAGCACCUAAGAUA 


1579 


GCUAUUUACUGCUCUUAUU 


1580 


UCACUGGGCUCGUUUAAUU 


1581 


GUGCGAGCCUUCUGAAUAU 


1582 


GCUAAGCUAUACUACUGUA 


1583 


UGACGGCGCUCUACUUCAC 


1584 




Sense . ? ; 




GAGAUGAAUUCCUUUGAAA 


1585 


GAAGAACGCAUGAAACGAA 


1586 


GAUAAAGACACGAUUGAAA 


1587 


GCUGAGAGGUCUAUUUAAA 


1588 



Sense 




GAACAACAAUGGCUAUGAA 


1589 


GUACAUACCUGGCUGGAUU 


1590 
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GAACAGCUCACAAGUAUAU 


1591 


GGAAACGUGUGUCUAUAUU 


1592 




- Sense 




GGAGGUGGGAGGACAGUUA 


1593 


UCACAGCCCUGGUGGAUGA 


1594 


GAAGCUGGCUCCUAUGUUC 


1595 


GGUCAACACUACGAACAUG 


1596 




• ■ ■ Sense r-.: ■ 


r. 


GAACACAAGGUCAACAUUG 


1597 


AGAAGGAGCUGGCCUAGUG 


1598 


CGGAAACUCUUCACAUUUG 


1599 


CAACAAAUUUGACAACAAC 


1600 



SLC21A2 


Sense v-v. 






GUACAUCUCCAUCUUAUUU 


1601 




GGAAGUGGCUGAGUUAUUA 


1602 




GAAGGGAGGCUCAAUGUAA 


1603 




GAAGGAAGUGGCUGAGUUA 


1604 



V ' '' ■■■ Sense S^v'. . 


m:-' .m 


GUAGAAACAGGAGCUAUUA 


1605 


CAAGAUUACUGUCAAACAA 


1606 


GCACAAGA6UAUUUG6UAA 


1607 


GCAAAUGUCCCUUCUGUAU 


1608 


GCAUGACUCCUAUAUAAUA 


1609 


AAACAGCAAUUUCCCUUAA 


1610 


GAAAAUGGCUCUUCAGGAA 


1611 





■ • 


GUUCAUCGCUCUCCUCUUU 


- 1612 


GGAUCAAGCUGUUUCUGAA . -V 


1613 


GGACUGCAGUUUGUACUUG 


1614 


GAGUGAAACUGACCUAUGG 


1615 




sr ' '■ Sen^y.-. v^'^l- 




GAAGGCUGCUCCCGUGGAA 


1616 


GAAAGCCACUCUAUGAAAG 


1617 


GAAACCAGGUGCCUUCAGA 


1618 


CCUCACAGCUGUAUUCAUG 


1619 




.Sense?;- - ;;■ 




CCAGGGAGCUGCUGGUCAU 


1620 


GGGUUGACAUCUUAUUUGA 


1621 


GCAGGAGGGUCUCUGUGUU 


1622 


GGCCAUCGCCUACUGAUUG 


1623 


CAACACCCAUGGCAAUUAA 


1624 


GAGGAAAGAUCUUGCUGAU 


1625 


GAGCAAGCGUCCUCCAAAU 


1626 


GCAACACCCAUGGCAAUUA 


1627 




Sense ' .-1" ' 




CCAAAGAACUCAAUGAACA 


1628 


ACAAGAACCUUCAGACUAA 


1629 


GAAGGUAGAUAGAAGAAUG 


1630 
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I GUAUUGAACUGUACUGUAA 



SLC4A4 [ 



XPNPE 
PI 



Sense 




GCAAUUCUCUUCAUUUAUC 


1632 


GGAAAGAUGUCCAGUGAAA 


1633 


GGACAAAGCCUUCUUGAAU 


1634 


GGAAUGGGAUCCAGCAAUU 


1635 




Sense ; 




UGAAAGCCAUUGACAUUUG 


1636 


CAGACACGCUGGAGUUUAA 


1637 


CAAUAGCGCUUUCUGGUUU 


1638 


GCAGGUAGCAGAUGGACUA 


1639 




'- Selnse 




UCAGAGUGCUGUCUUAUGU 


1640 


CAACUUGUUUGACGACGAA 


1641 


UGACAGAGCCUGCUGAUAC 


1642 


AGGCGGCUCUGUACCAUUU 


1643 








GAAACAUGGUGUGAUAUUG 


1644 


GCAGAUGUUUCCUUAUAUA 


1645 


CAACAGAGAUGCCAUGAUA 


1646 
"1647 


GAAAGGGGCUACAUUGAGA 




■ ■■-'ISense 




GACGUGAGCUUCCCAACAA 


1648 


GCGACUGGCUCAACAAUUA 


1649 


; GAGAUUGCGUGGCUAUUUA 


1650 


[ GACAGCAAGUGGACACUUA 


1651 






~"- Sense 




GGAAGAGACUCGUGCAAUG 


1652 


GGAACGAUGUGGCAAGUUG 


1653 


GAAGUAAGUCCUGAUUGAA 


1654 


GAACUGCUAUAGAUUUGUG ~* 


1655 








CAUAGAAGCUUUACCAAGA 


1656 


GAAUGGAGGAUGAAGAUUA 


1657 


GGUCAAUGCUCUAAGUGAA 


1658 


GAGAGGAGUUCUAUGAAUG 


1659 




: ■ Sense 


• 


CAUGAUGGCUUCUGAAUAA 


1660 


CAAGAGCUCAUGAGGAUUU 


1661 


GCAACAACUUCUCUUAUGU 


1662 


GGUCUUCGCUAGGAUCUGA 


1663 




. Sense 




GGACCUAGCUGUUGUAAAG 


1664 


GAAAGACCAUGCAUUUAAA 


1665 


GCAGGAAGAUGUAUGCUUU 


1666 


GAAUAAACCAGUAUCGUUA 


1667 
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Sensie 




GGACCCAGAUAUCCAAAUA 


1668 


GCAAUACUAUCCUAACUGA 
GAAUAUAGAUUAGCGUUUG 


1669 1 
1670 


GAUGAGGCAUGACUACUAA 


1671 ! 



; ; Sen^e. 




GCGAGUCUGUCCACUAUAC 


1672 


GAGAACGGGCACCCACUGA 


1673 


GAACGGCCACCCACUGAUG 


1674 


GGUCAAUGCUGCUGUGUAC 


1675 




Sense 




UCCAGGAACACUAUAAUUA 


1676 


GUGAGGAUCUUGUCCAUCA 


1677 


CAUGCUCUGUUGCGCCAUU 


1678 


CCAACAAGGUCCAGGAACA 


1679 




• - . Sense E. ^ " > 




ACACUGAUACUGUCGAUGA 


1680 


AAUAGGAGCAACACUGAUA 


1681 


CAGCAGGAGUUACCUUGUU 


1682 


GGACACCCAUGAAGCUAAU 


1683 




i.- -Sense- 




GCACAGAACUCCACUCAAA 


1684 


GGACAGCAGUUCAUUGUUA 


1685 


GAGAAGAGGGUAUGUUUAA 


1686 


GGACAAACUUCGACCCUUU 


1687 




-^r -sv, , Sense " ; 




UCGCUACCCUGCUGUUAUU 


:■' 1688 


GCOACAAGAACUACCGAUA 


1689 


CAAGAAAGCUUCCUUCAAC 


1690 


GAGAAGAAAUACAGUCAAU 


1691 




Sense --^ r^- |;. '^^ - ?'>i 


CAAUAUAUCUGAAGAGCUA | 1692 


GAACUGGACUGUGGCAUUG ! 1693 


GUGAGAAGAUGGUAUAUUU i 1694 


GAGGGUACUUUAAGACAUA j 1695 




«v : • - . V Sense - r^:- ■ \ ■ 


CAUGAGGUGUGAACUGUUA 


1696 


GAAGUGAAAUGCUUUAAUG 


1697 


AAAUAUGGCUCCUCCUUAG 


1698 


GCAAUCACAUUUAUGCAUA 


1699 


CAAGAUAACUGAGGUGGAU 


1700 


CAUGGUACAUUCAAGAUUU 


1701 






' Sense' ■ , 




GAACUCUACUUCAGUCAAU 


1704 


GGGCAAAUGCAUCAUAAUA 


1703 


GAAGAGAGGGAGUUUAAUA 


1704 


GAACAAAGCCACUGACUGA 


1705 
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•' ' — Ssnse. — ■ •■ " — _^ 


' ' 


GAAGUGAACUAUGAAGUAA 


1706 


CAACAAG G AUGACAAGAAA 


, 1707 

1708 ^ 


GGACAAAGUUUACCAAAUG 


GAGGGUCGAUCAUCUAUUA 


1709 

1710 
1711 


GAAUAUAGAGGGCUUAUGA 


CAACGACUAUGAAGAAUUC 


GAAGUGAGCAGAUCAGAAU 


1712 


GAGGAAAUCUCCAAAUGCA 


1713 




1 .^'^ i. J ; ■ Sense • : ■ 




CCAGGCAGCUGAUCAUAGA 


1714 


UCUCAGGUGUUGCCAAAUA 


1715 


GAACAGCUGUAAUCUAUGA 


1716 


CCACUGGUCUGUAGGGAUU 


1717 




? -t- -l -t -Sense ■ a?- % r 




UCGUAAAGCUGUUGAUAUC 


1718 


GAGGAGAUCUUUGAUGACA 


1719 


GUAAAGCUGUUGAUAUCGA 


1720 


GAUCGUAAAGCUGUUGAUA 


1721 




. * Sense:. ■/ ^- '*V;. 




AGACGAAGGUGAUUUACCA 


1722 


UGUGAGAGCUACCUAGUCA 


1723 


GAAGAAAUUUCAGAUGACA , 


1724 


UAAUAGGCAUUUCCUCUUU 


1725 




■ 't :: Sense ^- ' 




GUGAAGAUCUUGACCAAUG 


1726 


GAUCAGCAUACACAAAUUA 


1727 


UAAUGAAGCUUCUGCAACA 


1728 


GGCGCUAUGUGUAUUAUUA 


1729 




. : vi.- •: , *Sense -i^i. ;k - " ■ ■ ■'\. %. , 


GUACAAAGCUGGUAUAUCG .| 1730 


GAAAGACUCCCAGUGUAUA 1 1731 


GGAAGUCCAUGUCAUCGAA ]> 1732 


GCAAAGACAUGACGACGUU | 1733 




^ • ?s it -A; % SeRse^i . ■ V 




GUAAAGAGGAGACUGAAUG 


1734 


GGUCUGUGAUGGUCCCUAA 


1735 


CAAAGAUGCCUACGACAUG 


1736 


GGGCGAUGCCUGAGGGUUA 


1737 




Sense • ' 




UCACACAAGUUUAUGGUUU 


1738 


CAACAGCCGUGACCACIJIJU 


1739 


UAACGAAGCUGCAAUCAUG 


1740 


GAACUUGACGAUACUCUAA 


1741 




Sense 




GAAGAGAGGUCGUUCUAAG 


1742 


AAGCAGAUGUGCAl iRAI II lA 


1743 
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GUAAAGGGCCCUCUAAUAA j 1745 | 


■ • : ■ Sense | 


GAAAGAAUAUGCGCAAGUU 


1746 


GAAGAAGAAUGGCAGAAUG 


1747 


GCAGGAAGAUUAUGAUGUG 


1748 


AAAGAAAGCCCAUGUACUA 


1749 




'4- Sense 




GGGAAAGAGCUGCAUAUUA 


1750 


GCUUAGAACUCUUUACUGA 


1751 


UAUAAGAGCUUGACCAAUG 


1752 


GCAGAUAUCUCUAUGAUUG 


1753 1 




Sense v ' -7 


■■ J . 


CAACUCAUGUCCAAUACUG 


1754 


GGAAUGAGCCAGAUGUUUA 


1755 


GGAGACAGCUGUACGUUGU 


1756 ' 


UCGAAGAGGUGACAACUGA 


1757 




■ Wi- Sensaa?^^^;. .■ ^-M- ■ . • ".jj^v ■. 


GUAAGGAGAAGCAAUAUAA 1758 


AAGAUGAGCUUUCCAUGUA 1759 


GGACAUCUAUCUCAUCAUU 1 1760 


GACAAGAGCCCACAGAGUA j 1761 




Sense 




GUACUUCCCUCAGGCCUAU 


1762 ! 


GCUGUGCCUUGACUACGUA 


1763 


GUACUUCCCUCAGGCCUAU 


1764 


GGUCAGGUGCCUCGAGAUC 


1765 




^ %x -^ilSense '■ YXn- '' ' ■: 




CAGCGUAACUUCAUUCUUC 


1766 


UAACUUCAUUCUUCAGGUA 


1767 


CAGCUGCUCUUACCCAUUU 


1768 


GAUUGAAGCUAUUACUGAA 


1769 


UAGAAGAGCUCCGUCAGAA 


- 1770 


CCACAUAUGCGUUGAUUGA 


1771 


GGGCAGGGCUCUCUACCUA 


1772 




Sense ■ -fv^ 




GAACAGCCUUCAAAUCAAA 


1773 


GAUGUUCUCUACUAUGUUA 


1774 


GCAAAUACUGGAAGGAUUA 


1775 


GAGGAAAGCUCGUAAUUUA 


1776 




Sense 




CCACACGGCUGACCUCAUA 


1777 


CAGCAGAAUUCCUCUUAUA 


1778 


GCAGAAUUCCUCUUAUAAA 


1779 


GAUCGGGAUUUAUUUCUAU 


1780 




•Sense 




UGUGGGAGCUGAUGACUUU 


1781 


UCACAGAGAUGUUGAAAGG 


1782 
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UGGAAG AG AUCACAGGUUA 
I GCUCAUCGCUCACAACCAA 



;Sense " ; • ' 




GAGCACCACUUCUAUGAUU 


1785 


CAAAG AAGCU G U U CAAU AU 


1786 


UGAAAGCCCUCCCUUAUUA 


1787 


GAAAUAGCAUGGAGAAGGA 


1788 



' Sense 




CCAUACAGCUUCAUAAAUA 


1789 


GAAGAGAACUUAUCUAGUG 


1790 


GAAGUGGGCUCCAGUAUUA 


1791 


GCAAGAUGCUGAUUCAUUA 


1792 


GAAGUGGGCUCCAGUAUUA 


1793 


GAACGGACACUGAAAUAUU 


1794 


GCAGAUAGUUCUACCAGUA 


1795 




-v''^ ' Sense ''v ' . •' 




GAACAAGGAG.UCAGACAUU 


1796 


AAACUAGGGGGUUGAAUGA 


1797 


GAUGGAACUUCGACUUUGU 


1798 


GUAAACAGAUGGCACUUUG 


1799 








GGAAUGGACAUCCUGUAUA 


1800 


GGAGAAAGAUGUCAAACGU 


1801 


GAAUGGACAUCCUGUAUAA 


1802 


GUAAACAGCUCGAAUUAAG 


1803 








1 CAGAUAGGCUCCGAAGAUG 


1804 


AGACUCAGCUCCAGAAUAC 


1805 - 


GAUCGGUUCUUUCAUCUUC 


1806 


CAGGAUCGGUUCUUUCAUC 


1807 




■ Sense ' ' M-^ ti 




CCAGAUAAGUGACAUAAGU 


1808 


UAAGUGACCUGCUUUGUAA 


1809 


GAAGAGAGAUUCCAUUGAA 


1810 


UGAAAGAGCUCAACAACAA 


1811 




i Sense ^ ,: ' i;. 




GAGCAUGCCGUCAAUAAUA 


1812 


CAAGAGAACUCAUCAUAAG 


1813 


CCAAAGUGCUGCAGUACUA 


1814 


UAAGAAAGCUAUCCAGAUU 


1815 




Sense 




GAAACAAGCCCUUUAAGUU 


1816 


GAAUUACUCUCCAAGUUGA 


1817 


CAGCACAAGUGGUAGGUUA 


1818 


GUUGAGGACUGAAUUACUC 


1819 


GAUGGCAGCUGUUUAAAUG 


1820 


GAGUAUCCUUUCAGUGUUA 


1821 
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1A 



■' • V' ? ;■ .Sense ■ 




CAAAGGAACCUACUUGUAC 


1822 


GGAACCUACUUGUACAAUG 


1823 


GAACCUACUUGUACAAUGA 


1824 


GAGUGUGUCUCCUGUAGUA 


1825 




^;v Sense 




GGACAAGAAUCAAUGGAUA 


1826 


GAACAAGCCUGCAGGAUUC 


1827 


GGACUUGUGUGCCCUUAUA 


1828 


GAACACAAAGGCACUAUAA 


1829 




.' / -t. - ;^ Sense -1- « 




CGAAGAAAGUGAUGAAUUU 


1830 


GCUCUUUGCGCAUCUCUUU 


1831 


UGAAAGACCUGGUGGAAGA 


1832 


GCAAUUCAGUUUCUACAUC 


1833 




-i. f" Stense 




GAAGACAGAG UU AU UAAAC 


1834 


UCACGAAGACAGAGUUAUU 


1835 


AGACAGAGUUAUUAAACCA 


1836 


CACGAAGACAGAGUUAUUA 


1837 


GCUGAAGCCUGUCUGAUGU 


1838 




■ ''> ■■■ "ISense v;; -Sa 




CAAAUGAUCUGAGGCAGUU 


1839 


GUUCAUAGUUUGAGCGUUA 


1840 


GGAGAAACCUGUUGUGAUU 


1841 


GGACAAAGUUGCUGAAAUC 


1842 


CAAAUGAUCUGAGGCAGUU 


1843 


GGAGAAACCUGUUGUGAUU 


1844 


GGACAAAGUUGCUGAAAUC 


1845 


GUUCAUAGUUUGAGCGUUA 


1846 




1 • "Sense' h - ' ■" 




UGAAGCACCUUGAUCUUUG 


1847 - 


GGGCAGCGCAUACCUGUUU 


1848 


GAGGAGCGCUGUUUGAGUU 


1849 


GGACGAGGAGCGCUGUUUG 


1850 


GAGGAGCGCUGUUUGAGUU 


1851 


GGAUGUCUCUCUCCUCUUU 


1852 


GCUCACUCCUUUCUACUAA 


1853 


UGAAGCACCUUGAUCUUUG 


1854 




Sensei; 




GCACAGAUAUUUCCAUUUC 


1855 


GCAGUCCUCUUAUUCCUAA 


1856 


GAACUCAAGCUGCGUUUAU 


1857 


GGACGAAUUGAGAUAAUAU 


1858 



Sensed .: . - | 


UAAGAACACUGCUCAUGAA ! 1859 


GAGGCAUCCUGAAGCAUUA 


1 1860 


GAAGGCGGCUGCAGAACUG 


1 1861 
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Sense | | 


CUAUCGAGGUCGUUAAAUU 


1863 


1 


GAAUGCAGCUGGAAGAUCU 


1864 


■-i 


GCGGCGAGCUGGACUGUUU 


1 1865 




CCAGACCGAUGUGUAUUUA 


t 1866 





:?Sense 




iGGUCUCACCUCGCUAUGGU 


1867 


GAAAGCGAAUUCCUAGACA 


1868 


GCACAAACAAGAGUGAUGU 


1869 


GAAGGGCACCUCAGAAACA 


1870 


UCUGCCAACUUGUAUUAAA 


1871 




l '■ ' .M '- ^ense 'i -: yS' . 4 




UCAAGUGUCUUCCAUCAUG 


1872 


UCAAGUGCCUUAAUAGUAG 


1873 


GGAGUACCCUGAGGCUAUA 


1874 


GAUGAGAUCUUCCUACUGU 


1875 




, . ;- ■ s -'Sense: - 'H - 




GAGCUGGGCUAAGUAAAUA 


1876 


GACCAAAGAUGGAGUGAGA 


1877 


GGAAGAAACUGGUGAUUGU 


1878 


GGCUGUAACUACUUUAUAA 


1879 




. Sense 




GGACAUUUGUUUGCCAUUU 


1880 


GGAGAACCAUAUACUCUUG 


1881 


GAACCAAUGCUUUCUCAUG 


1882 


GAAGACCUGUUAUGUAGAG 


1883 


GAUCAAGAAUUGCAAUAUC 


1884 


GAAAAGGGGUGACCUAGUA 


1885 


UGACAAACCUUAUGGAAAA 


1886 




Sense v ^; . 




GGAAUGAGCUUCAGAUGCA 


1887 


GGACACAGCUGUAAGAUUG 


1888 


GAGAAGAGAUUACAGAUAA 


1889 


GAAGAAACAUUCGCUAUUC 


1890 




' " Sense 




GAGGGUGGUUUAUGAUUAA 


1891 


CAACAAAG AAC AAU CAC UA 


1892 


GAAGAAAU AU ACACG G U U U 


1893 


UACAUGAGCUUUACAGAUA 


1894 




Sense 




GGUAGGAGAUGAAUUGUUU 


1895 


AGAAGGAACUGAUCAUUAA 


1896 


CUACAGACCUCCAAUAUCA 


1897 


GAAACUGGCCAAACGGUUA 


1898 
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fGAliUAUCGCUGCAAAGGAA 
fGAiSAGUGCCUGCAAGCUUU 



:d 



GACAAGAGGUGGCCAUAAA 


1901 


UUAAAUCGCUGUCUUGAGA 


1902 


..._J 






. :>^ Sense ' ' - > . I • - . ■ 


ACUAAGAGGUGAACAUGUA 


1903 




GAUCAUGAAUGUCCGAAGA 


1904 




GAUGAGACCCUAGUACUGA 


1905 




CAGCAAAGGUGCCAAAGAU 


1906 






, j -'^' Sense .' | " 


UAAAGGCAGUUGUCCACUA 


1907 




GAAGGGACCUGCUUUCUUG 


1908 


GCAAAGACGUCCCUAAGAG 


1909 


ccaaugggguggcugcAaa 


1910 







gagcacggcuuuaauaagu 


1911 


caaacuccguuaugauaua 


1912 


ggauaaaguugucugauuu 


1913 


ggaaaugccuccauaaaua 


1914 



'f Sense AkI- "/<^ 


%c - ' . A'i: 


ggacauggcugugaauugg 


1915 


agaaagaagucaccgaaga 


1916 


ggagaaggccacccaauga 


1917 


ccacagcgaugacuacauu 


1918 



M : ■ Glf jSense ^: 1 ' t. 


gcuguuaaauuauggcuua 


1919 


gcaaagaaagcuagaauug 


1920 


caucagagagucuuauaua 


1921 


CGAAUGAGUUGCCAUAUAA 


1922 



"Sefise-- - .'fir ; 




GGCCAUAGCUUAAUUAAUC 


1923 


GCACAGCCGUU.UACCAUGA 


1924 


GGACAGCCGUUUAGUCAAG 


1925 


GAACUGAUUCCUGAAAUAA 


1926 



Sense • . . ''-;-svi 




CACAUAAACUCGUAUCUUA 


1927 


GAGAAGGCACUCAAGUUUA 


1928 


AGACAUAGUUUACAGAGAA 


1929 


GCAGAGAGAUUUCAUAACU 


1930 




: Sense- 




GAACAUACCUGUAAAUCAU 


1931 


GGACAUCGACCUGUUAUUA 


1932 


UAAAUGAACUCCUACAUAG 


1933 


GGAAUUAUCUCUGAACAUA 


1934 




Sense ' 




GOAGAAAUACAUCUACUAA 


1935 


GCUAUGAGCUGUUCUUCAA 


1936 



wo 2004/045543 



PCT/US2003/036787 



LggACAAAGCU CUACUCAUC 



IGCUCAACCCUGGAGACAUU 



J, .- Sense:. ' 




GGACAAGACUCGCAGAUUU 


1939 


GCUGAGCGCUUUGCAAUAA 


1940 


CAAGAAGUCUCACGGGAAA 


1941 


UCACAGAGGCCAAGAAAUU 


1942 




■.V' /Sense'' 




UGGAAGCCAUCGCCAAAUA 


1942 


CAUCAGUGCAUGACGUUUA 


1943 


UGAAUGAGCUGGUGGAUUA 


1944 


UGCCAAAACUUACCUAUAA 


1945 




■ W:- ■ %l Sense "X W 




GAGCUGCACUCCAAUGAGA 


1946 


GAAACCAAGCCAUUAAUGA 


1947 


CCAAGGAGCUACUGACAUU 


1948 


AGAGAAACAUGGCCCAAUA - 


1949 




' ?- 'b^i. ' Sense r. ■ • • 


- A: / * 


CCAGAGACAUUUACAUUAA 


1950 


GAAGGGAGUUUGCUAAAUU 


1951 


GAACAGAUCUGAUGAAUGA 


1952 


CAAGAGAGCUGAAGACUAU 


1953 




;!:;Sensie; - ■ '■ ■ 




GCAUAUAUAUUCAGCAUUG 


1954 


CAACUUGACUGCAGUAUUG 


1955 


GAACUUAACUUUCCAUGUU 


1956 


GACAAGACCUGUAGUAAUU 


1957 




• Is. Sense 




AGAAAGAGCUUGACAGUAA 


1958 


GGAAGUAGUUCACAAAAUA 


1959 


UGAAGUAUCUGUAUCCAAA 


1960 


GAGCUUCACUCCCUUAGUU 


1961 




•'SSensei'S;;, 




UAAGGACUCUGAAGAUGUA 


1962 


GAGAAAGUGUGUAAUUAUG 


1963 


GCUCAGGACUUAGCAAGAA 


1964 


GAAACUGAAUACCUAAGAU 


1965 


GAAAGUGAAUACCUAAGAU 


1966 


UAAGGACUCUGAAG AUG U A 


1967 


GAGAAAGUGUGUAAUUAUG 


1968 


GCUCAGGACUUAGCAAGAA 


1969 




Sense ■ 




CCAUCCAGCUGAUCCAGAA 


1970 


GAACCCUCCUGAUGAGAGU 


1971 


GAGGACAUCCACCAGUACA 


1972 




Sense I 


!GAUUAGAGACCAAGGAUUU I 
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CCACUGAUGUGUGUUAAUU 


1973 


CAAUAGAACCUGUCAAUAU 


1974 


GAAGACAGGAAUCGAAUGA 


1975 




. ' " Sense .. | ' ■ 


GAUGUGAGUAGAAGAGUUA | 1975 


GGAAGAAUUUGUACCAUUU i 1976 


GAACGACGUUUCUUUCUUU ! 1977 


GGAGUCAUCUCUUCCUAUA | 1978 




^■a Sense ■ -f- ',: ?• 


GGAGAAGCCUCACCUCUUG ! 1979 


GGAGAAAGGAGUCAAGAUU ! 1980 


GAGAACAACUACUCAUUGA ! 1981 


GAACUUCUCGUCACUGUAU | 1982 




i. ■ " : ' - SSense • Si" . /■ 




GGAUUGUUCUUUCUUCAUA 


1983 


GAAGGGUUCUCGUAUGUGA 


1984 


GAAGAAGGAUGUGGUGAUU 


1985 


GGACUGGGAUCGAACAACA 


1986 






• • ■ Ir-; S« 




GGACAGAAGUGGAAAUAUU 


1987 


UCAAAGAGGUGAACAUUAA 


1988 


GACCAAAUCUCAGUUGUUU 


1989 


GGAGAAUGGUGCUGUUUAA | 1990 




Sense 1 c.^ 




GAAGAGACCAAAGUAUAAU 


1991 


GAAGACCGGCCACGUCAUU 


1992 


GGAAGAGACCAAAGUAUAA 


1993 


GCAUUGAGAUUGACCAGAA 


1994 


UGAGAGAACGAGAAAGUUG 


1995 


GUGAAACCCUGUCUGCAUU 


1996 


GGAUCUCUCUCAACAACUA 


1997 


ACAACUAGGUGAACACAUA 


1998 




■ M- Sense" ' ' 1 f / ' ■ 




UGACAGUCCUGAAACGAUA 


1999 


GAUUGGAGAUUCUACAUUC 


2000 


GCUCAUGGAUGCAAAUCUU 


2001 


GAAGCUAAGCCGACCAUUU 


2002 




y -Yj Sense 




AAAGAGAGCUUAUCGUGAA 


2003 


GAUGAUAGGUUAGAAAUAG 


2004 


ACAAAGAAGUCAUGGAUUG 


2005 


GGAGCUGGAUCAUGAAAGA 


2006 




Sense 




GAAAAGGGAUGAUGGGAUU 


2007 


CCUAGACGAUGCCAAAUAU 


2008 


GAGCCAAACCAGGGAUUUA 


2009 


UGAAACGAAUGCUGGAGAA 


2010 


UCAGUCACCCAGAGAAAUA 


2011 



wo 2004/045543 PCT/US2003/036787 
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CCAAGAAAGCUAUCUCUGA 


2012 1 


AGACUCACCUAGAGCUAAA 


2013 




Sense . .. 




CCUGGAGGGUCCUGUACAA 


2014 


GAGCAAAUGAGCCAAACGU 


2015 


GGAGGGUCCUGUACAAUCU 


2016 


GACUUUCUCUGCACCAUGU 


2017 




Sense ' • 


1 


CCAGGGAGCUUGAAAGUUU 


2018 i 


AAAGUGCAGUUCAGUAAUA 


2019 


GAGAAUCACUAACCAGAGA 


2020 


GAGCCCAUCGCUAUUAUAA 


2021 




; ■ Hfijl'-iSense i$ 


:• ,1 • ■ .} 


GAGACGAGUUUAACGCUUA 


2022 


AAAGCAACCUUCUGAUGUA 


2023 


GGGAGAAGGUAGACAAUUG 


2024 


GCAAAGCAACCUUCUGAUG 


2025 


AGACAGAGGCACAAGGUAA 


2026 


GCAAGGAGGUUAGAGAAAU 


2027 


CAAGGAGGUUA6AGAAAUA 


2028 


UCUUACGACUGUUACGUUA 1 2029 




. f f"- Sense' 4 . 




GAAGACAUGAUCCGGAAUA 


2030 


CAACAGCGUUCCUAGAGAA 


2031 


GAAAUGGGAUGGACUGAAC 


2032 


ACGAUGAGCUGCAGACUGA 


2033 • 




'.'-Sense ■"^S . "^^ 




GAAAGAAGCCUGCAUAUAA 


2034 


GAAAUUGACUCUACAUUGU 


2035 


ACAAAUAGCACUUAGGUUA 


2036 


GAAUACACCUGUGGUUAAA 


2037 




• -^nse i^.v ^ 




GGAGAUGCCUGCCAUUAAA 


2038 


UCAAUGAUCUUGUGUUAGA 


2039 


GAAAGAACAUGUAAAGUGU 


2040 


GAAGAAAGAACAUGUAAAG 


2041 




■ Sense HS\ 




GUAGAUAGAUGGCAAUAUG 


2042 


GAGGAGGGGUAACUGAUUG 


2043 


GAGGAACCCUGGCAUGUAU 


2044 


GCACGGAUCUUUACUUUUG 


2045 




Sense- 




GGCGUAAGAUGAUGGAUUU 


2046 


6CAAAGGAAACCAACAAUA 


2047 


GCACAAAGGCAUUGUAAGU 


2048 


CAAAGGAAACCAACAAUAA 


2049 



BRCA1 [ 



Sense 



~ I 
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CCAUACAGCUUCAUAAAUA 


2050 


GAAGAGAACUUAUCUAGUG 


2051 


GAAGUGGGGUCCAGUAUUA 


2052 


GCAAGAUGGUGAUUGAUUA 


2053 


CCAUACAGCUUCAUAAAUA 


2054 




:; • ' ■ Sense : .• " . 




GAAAGUUAAUGUCAAGGAA 


2055 


GAGCAACACUGGCAUAACA 


2056 


UAACAGAGAUUUGCCUAAA 


2057 


GCGAAGAGCUGACCAAAUA 


2058 




■I ' ' .-i ^ ' Sense . . - ' 




CAAAGGGUUUCUCUGUUUA 


2059 


GAAAUGACCUCCCUAAGUU 


2060 


GAAGGCAGCUGGUAUAUUC 


2061 


GACAUGAUCUUCCUUCUGA 


2062 


GCACUCUUCUGUUCCCUUA 


2063 




S^nse m 


. i 


GUAUUAAACUCUCCUUUGA 


2064 


GCAAGGAGAUGUCUGAAUA 


2065 


CAACUUCCGUGAUCUUUAA 


2066 


GCUCAAAGAUGUAAUGUAG 


2067 




*U Sensel-W ' '' V- \ ■ l| •■ 


GAAGAAGGAGUCAGACAUU ! 2068 


AAACUAGGCGGUUGAAUGA | 2069 


GAUGGAACUUCGACUUUGU 2070 


GUAAACAGAUGGCACUUUG j 2071 




5 ~ Sense ' -^i-,. ' " 




GAUGUGUCCUCAUUAAUUU 


2072 


GAAGAGAGAUACAAGAUGA 


2073 


GAGCAUACCUGAAGAGAGA 


2074 


GCUAUGAAGUCCAGAAAUU 


2075 




■ ■ •. • ■ s ■• SSnsfe . ,. St- 




GUGAAUAUGUGAAAUAGUG " 


2076 


AAAGCAUGCUAGAGUAUGA 


2077 


UUAAGAAUGUGGAGAAGUA 


2078 


GAUAACAAGCUGACAGAUA 2079 




■ ' f ■ Sense • ' 




GGACGUAAUUGAUGAAGUA 


2080 


GCAGAUGGGUUCUUAAAUG 


2081 


CAAAUGAGGUUGAGGAAAU 


2082 


GGAAAUACCUGGAGGAUGA 


2083 




Sense 




GAUCCACGUUUCAAUGAUA 


2084 


GCAUUUAGGUGGAGACUUC 


2085 


GAUGUACGAUUAUAGUUUG 


2086 


UGAAUUAGAUCCACGUUUC 


2087 
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Sense .- ' , 




GCACAAGAAUGGAUCACAA 


2088 


GCUGAAACAUGCAGUUGUA 


2089 


GUACGUAGCAUGCAGAAUA 


2090 


GAACUUGCAUUGUGAUUGG 


2091 




.;■ / .J; • ■: - . • Sense . ■ - * \. 




GAAGCAUGACGGACAAGUA 


2092 


GAACAUUCCAGAGCGUGUA 


2093 


GUUCUUAGUUGCUGUAUGU 


2094 


CAUCAUGGUUGGCCUUAUC 


2095 




'Sense - . ' y:;vy^-|rfe?;-"--:'v#;i;''"^^ .y-- 


GGAACAAACUGGCAAAGCA | 2096 


GAUCAGAGCAGCAGUGAAA | 2097 


GGGCAAAACUGAAUUAUAA 


2098 


GAUCUCAGGUUCUCCUUGA 


2099 




Sense ' 


lUllM 


CUACAGAUCUCCAAUGAAA 


2100 


GCUACAAGCUGGAGAAUGA 


2101 


GGAAACAGCUAUGUGGAAA 


2102 


i GGAGUUGGAUCUCUCAGAA 


2103 




; . Sense 




GUAGUAAGGUAAACUGAAA 


2104 


GACAAUAUGUUCAAGAGAA 


2105 


GACAACAUCUCGUCAGCUA 


2106 


UAUAUGAUGUGUCCAGUAA 


2107 








CAAAGAAGCUCACUUACUG 


2108 


GAACAGCCCUAAAGUGGUU 


2109 


GAACAUGGAUGACCCUAAG 


2110 


GGGCAAAGCUAAAUCAAGU 


i 2111 



. ■ ' • Sense^P?-' . • lift 






GGAAAGCGAUGUGGAGAUU 


2112.- 


CCAAGACGGUGGCCUAUUU 


2113 




AAAGCGAUGUGGAGAUUUA 


2114 


GUGAGGAGCUUCCCUAUAG 


2115 



' ■ Seme' 




GAAUUCCUCUUGUCGAA6U 


2116 


GUUAUUAGCACCUUUAAGA 


2117 


GGAGGGAGGCCAUGACUUG 


2118 


CAGGAGAGCUCAAGAAUGG 


2119 


GGAUAUGGAUUUCAGUUAA 


2120 


GGAAGUCGGUGCCUUGGUU 


2121 


GGAAGGAGAGGACUGGUUU 


2122 




Sense . 




GCAGAUACUUGGACGGUUU 


2123 


GAGUAGAACUAGAAUGUGA 


2124 


GCGAAUAAAUCAUGAAAGA 


2125 


GAAGAUGGAAUUAUGCAUA 


2126 



wo 2004/045543 



144 



PCT/US2003/036787 



• " ' Sense 




GGCAAUGCGUCUCGCGAUA 


2127 


GAUCCGCAAUGAUGAGGAA 


2128 J 


GCAAUGGGUCUCGCGAUAA 


2129 


GAGGAACUCAAUAAGCUUU 


2130 




/ ' ;• Sense " "' ■ f 


AAUAGAAGCUGUGCAAUUA 


2131 


CAACUGACCUCAUCUGGAA 


2132 


GAAGGAAUCUGAUCU UAAU 


2133 


GGGAAGGGUUUCUCUAUUA 


2134 




Sense 




GGAGGUUGAUUGAGAAUUG 


2134 


GGCAAUAGGUGAGCGUUUA 


2135 


CAAAUACGCUUAGGUGUGU 


2136 


GGAGAUCGCCUACAAGUUC 


2137 





■ '-W&i'-- -l & 


GCAGAAACACUCCAAUUUA 


2138 


IGUAAAUACGUGAAUGCAUU 


2139 


IGCAGUGAACUUUUGAGAUA 


2140 


GAAGAACAGUCAUUUGAUG 


2141 



'lev i.- .-'Sense ' V 


■' 


GAGGUGAGCUGUUAAAUCA 


2142 


GCAGGGUGAUUUCCUAAUA 


2143 


GGGAGAAGAUAUUUAAUUG 


2144 


CAACUUCUCUCCUGAACUU 


2145 



V. Sense ' - 


■ V. ■l-'W' 


GGACACGGGACUGGACUUG 


2146 


GCACUUGGCUGUGAUUCAU 


2148 


iGAGACGAGGGCGAUGAAUA 


2149 


CAUGAACCCUUCCUGGAUU 


2150 



; Sense ^ 




GAACAUGGACUUGUAUAUU 


i 2151 


GAUGUGGGGUGAAAAGUUA 


2152 


GGACGAGAAAGAUCAUUGA 


! 2153 


AGGACGAGCUGCCCUAUGA 


! 2154 



"MC Sense ■ -n - 




GAAGGGAAGUUUCAGUAAC 


2155 


GGAAGGGAAGUUUCAGUAA 


2156 


GGAAACUGCUGCUGUGUAC 


2157 


GAACCAACCACUCAUGGAA 


2158 




Sense . 




GGGAACAGUUUGAAUAUAA 


2159 


GCAGUAGCCUGAAGCAGAA 


2160 


CGAGAAGGAUGCACAGAUA 


2161 


GCAAGAGGCUGAGAGGAAA 


2162 



NUP153 [ 



Sense ■. ^.t ' I 
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GAAGACAAAUGAAAGCUAA 


2163 




GAUAAAGACUGCUGUUAGA 


2164 


GAGGAGAGCUCUAAUAUUA 


2165 


GAGGAAGCCUGAUUAAAGA 


2166 




■— "■ ;/■'; • Sense ' | 


GAAAGAGCAUGAUGACAUA 


2167 




GAGGAGAGCUCUAUUAUGU 


2168 




GAAACUGAAUGGAAGAAUA 


2169 




AAAGAAGGCUGUACCGUUA 


2170 






-fek" . -Sense ■ !'"'^ '.;v ■ 5,"--" 


CUACAUGUCUUUGCUCUUA 


2171 




GCUAAGUCCUGUAAGAAUA 


2172 




CAAAGGCAAUGUACUGUUU 


2173 




GAACAAUGGUGGAUCCAAA 


2174 





> ' Sense " 




AAGUUCAGCUUCUCUAUUA 


2175 


GAAGAAAUCUCUGAUGGAU 


2176 


GAACACCUUUACUCUAUAA 


2177 


GCAUGGAGCUGGAGAACUA 


2178 



Sense 




GAUGAAAGCUCUAAAGAUG 


2179 


GAAAGGAGGUUCUAAACUA 


2180 


GGAAGAAGCUCAUUUGAUU 


2181 


GCAAAGAGGUGGCAGUUAA 


2182 



Sense 




GGAAGAAGAUCCACAUGAA 


2183 


GAACAUACUUUCAGAGCUU 


2184 


GAACAAUCUUUGCUGUAUA 


2185 


UAACAGAACUGGUUCAACA 


2186 









GAAGAGAUCCACACACAGU 


2187 


UGAAUGAGCUGCUGCACAU 


2188 


GAAGAUAGGUUUCGAUGUG 


2189 




GAAUUACCCUUCCUCAUCU 


2190 



■-■ s!;:"" , f-Aj_sense . . ■^ ■ 


^ - ■ ■ 1 


CUACAGAGCUUCAGUGUGA 


2191 1 


GAACAUAUUUGAUCGAAAG 


2192 t 


GAGCAAACCUAGAUAAGGA 


2193 


GCAUUGAUCUCUUCUUAGA 


2194 1 



Sense 




GGAAGACAGUGGUGAACUU 


2195 


GCAAAGCAUUGGAAUCAGA 


2196 


GAGCUGACGUGGAAGAUGA 


2197 


GAACGGGCCUGGAACCAUA 


2198 



TNFRSF 
6 


Sense 






GAUACUAACUGCUCUCAGA 


2199 
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GAAAGAAUGGUGUCAAUGA 


2200 


UCAAUAAUGUCCCAUGUAA 


2201 


UCAUGAAUCUCCAACCUUA 


2202 


GAUGUUGACUUGAGUAAAU 


2203 




Sense . .: 




GAAAGGAAAUGACUAAUGA 


2204 


GAAGAAGGCUGUUCAGAGA 


2205 


GGAAGUAGCUACGUUCUUU 


2206 


GGACAU AAG UGG AAAGAAG 


2207 




■ ■ ■ ■ . Sense i--:';' , ^ 




GAAAGAGUCCAUCAGAUUU 


2208 


CAAACUACAUUGGCAUUUA 


2209 


AAACAGACAUGGAUGGAUA 


2210 


CGAAAGGAAUGGUUAACUA 


2211 




: • Senses '.' ' '::-f:i- ■ 






2212 


GAAACUAUCUGGAUGUGUA 


2213 


GCACAAAGAUGGUAUCGUA 


2214 


GGAAAUGGCUGUGGUAACA 


2215 




M^^v ^^nse 






2216 


GCACAUGGGCUGCGUCUUU 


2217 


CCAGUGCGCUUCAAGAUGA 


2218 


GAACAUCUGCUUUGAGGUU 


2219 




Sense 




GGUAUUGCCUUGUGAAUUU 


2220 


GCAGAACAAUUACGAAUAG 


2221 


GUACAUAGCUGUUUGAAAU 


2222 


GGUGUAAACUUGUAGCAUU 


2223 



In addition, to identifying functional siRNA against gene families or 
pathways, it is possible to design duplexes against genes known to be involved in 
5 specific diseases. For example when dealing with human disorders associated with 
allergies, it will be beneficial to develop siRNA against a number of genes including 
but not limited to: 
the interleukin 4 recq)tor gene 

(SEQ. ED NO. 2224: UAGAGGUGCUCAUUCAUUU, 
1 0 SEQ. ID NO. 2225 : GGUAUAAGCCUUUCCAAGA, 
SEQ. ID NO. 2225: ACACACAGCUGGAAGAAAU, 
SEQ. ID NO. 2226: UAACAGAGCUUCCUUAGGU), 



the Beta-airestin-2 
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(SEQ. ID NO. 2227: GGAUGAAGGAUGACGACUA, 
SEQ. IDNO. 2228: ACACCAACCUCAUUGAAUU, 
SEQ. ID NO. 2229: CGAACAAGAUGACCAGGUA, 
SEQ. ID NO. 2230: GAUGAAGGAUGACGACUAU, ), 

5 

the interferon-gamma receptor 1 gene 
(SEQ. IDNO. 2231: CAGCAUGGCUGUCCUCUUU, 
SEQ. ID NO. 2232: GUAAAGAACUAUGGUGUUA, 
SEQ. ID NO. 2233: GAAACUACCUGUUACAUUA, 
10 SEQ. ID NO. 2234: GAAGUGAGAUCCAGUAUAA), 

the matrix metalloproteinase MMP-9 
(SEQ. ID NO. 2235: GGAACCAGCUGUAUUUGUU, 
SEQ. IDNO. 2236: GUUGGAGUGUUUCUAAUAA, 
15 SEQ. ID NO. 2237: GCGCUGGGCUUAGAUCAUU, 
SEQ. ID NO. 2238: GGAGCCAGUUUGCCGGAUA), 

the Slcllal (Nrampl) gene 

(SEQ. ID NO. 2239: CCAAUGGCCUGCUGAACAA, 
20 SEQ. ID NO. 2240: GGGCCUGGCUUGCUCAUGA, 
SEQ. ID NO. 2241 : GGGCAGAGCUCCACCAUGA, 
SEQ. ID NO. 2242: GCACGGCCAUUGCAUUCAA), 

SPINK5 

25 (SEQ. ID NO. 2243 : CCAACUGCCUGUUCAAUAA, 
SEQ. ID NO. 2244: GGAUACAUGUGAUGAGUUU, 
SEQ. DD NO. 2245: GGACGAAUGUGCUGAGUAU, 
SEQ. IDNO. 2246: GAGCUUGUCUUAUUUGCUA,), 

30 the CYP1A2 gene 

(SEQ. ID NO. 2247: GAAAUGCUGUGUCUUCGUA, 
SEQ. IDNO. 2248: GGACAGCACUUCCCUGAGA, 
SEQ. IDNO. 2249: GAAGACACCACCAUUCUGA, 
SEQ. ID NO. 2250: GGCCAGAGCUUGACCUUCA), 
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thymosin-beta4Y 

(SEQ. ID NO. 2251: GGACAGGCCUGCGUUGUUU, 
SEQ. ID NO. 2252: GGAAAGAGGAAGCUCAUGA, 
5 SEQ. ID NO. 2253: GCAAACACGUUGGAUGAGU, 
SEQ. ID NO. 2254: GGACUAUGCUGCCCUUUUG, 

activin A receptor IB 

(SEQ. ID NO. 2255: ACAAGACGCUCCAGGAUCU, 
1 0 SEQ. ID NO. 2254: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2255: GAAGCUGCGUCCCAACAUC, 
SEQ. ID NO. 2256: GCAUAGGCCUGUAAUCGUA, 
SEQ. ID NO. 2257: UCAGAGAGUUCGAGACAAA, 
SEQ. ID NO. 2258: UGCGAAAGGUUGUAUGUGA, 
15 SEQ. ID NO. 2259: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2260: GAAUAGCGUUGUGUGUUAU, 
SEQ. ID NO. 2261: UGAAUAGCGUUGUGUGUUA, 
SEQ. ID NO. 2262: GGGAUCAGUUUGUUGAAUA, 
SEQ. ID NO. 2263: GAGCCUGAAUCAUCGUUUA, ), 

20 

ADAM33 

(SEQ. ID NO. 2264: GGAAGUACCUGGAACUGUA, 
SEQ. ID NO. 2265: GGACAGAGGGAACCAUUUA, 
SEQ. ID NO. 2266: GGUGAGAGGUAGCUCCUAA, 
25 SEQ. ID NO. 2267: AAAGACAGGUGGCCACUGA), 

the TAP 1 gene 

(SEQ. ID NO. 2268: GAAAGAUGAUCAGCUAUUU, 
SEQ. ID NO. 2269: CAACAGAACCAGACAGGUA, 
30 SEQ. ID NO. 2270: UGAGAAAUGUUCAGAAUGU, 
SEQ. ID NO. 2271 : UACCUUCACUCGAAACUUA, 



COX-2 

(SEQ. ID NO. 2272: GAACGAAAGUAAAGAUGUU, 
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SEQ. ID NO. 2273: GGACUUAUGGGUAAUGUUA, 
SEQ. ID NO. 2274: UGAAAGGACUUAUGGGUAA, 
SEQ. ID NO. 2275: GAUCAGAGUUCACUUUCUU), 

5 ADPRT 

(SEQ. ID NO. 2276: GGAAAGAUGUUAAGCAUUU, 
SEQ. ID NO. 2277: CAUGGGAGCUCUUGAAAUA, 
SEQ. ID NO. 2278: GAACAAGGAUGAAGUGAAG, 
SEQ. ID NO. 2279: UGAAGAAGCUCACAGUAAA, ), 

10 

HDC 

(SEQ. ID NO. 2280: CAGCAGACCUUCAGUGUGA, 
SEQ. ID NO. 2281: GGAGAGAGAUGGUGGAUUA, 
SEQ. ID NO. 2282: GUACAGAGCUGGAGAUGAA, 
1 5 SEQ. ID NO. 2283 : GAACGUCCCUUCAGUCUGU), 



HnmT 

(SEQ. ID NO. 2284: CAAAUUCUCUCCAAAGUUC, 
SEQ. ID NO. 2285: GGAUAUAUCUGACUGCUUU, 
20 SEQ. ID NO. 2286: GAGCAGAGCUUGGGAAAGA, 
SEQ. ID NO. 2287: GAUAUGAGAUGUAGCAAAU), 

GATA-3 

(SEQ. ID NO. 2288: GAACUGCUUUCUUUCGUUU, 
25 SEQ. ID NO. 2289; GCAGUAUCAUGAAGCCUAA, 
SEQ. ID NO. 2290: GAAACUAGGUCUGAUAUUC, 
SEQ. ID NO. 2291: GUACAGCUCCGGACUCUUC), 

Gab2 

30 (SEQ. ID NO. 2292: GCACAACCAUUCUGAAGUU, 
SEQ. ID NO. 2293: GGACUUAGAUGCCCAGAUG, 
SEQ. ID NO. 2294: GAAGGUGGAUUCUAGGAAA, 
SEQ. ID NO. 2295: GGACUAGCCCUGCUGUUUA), and 
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STAT6 

(SEQ. IDNO. 2296: GAUAGAAACUCCUGCUAAU, 
SEQ. E) NO. 2297: GGACAUUUAUUCCCAGCUA, 
SEQ. ID NO. 2298: GGACAGAGCUACAGACCUA, 
5 SEQ. ID NO. 2299: GGAUGGCUCUCCACAGAUA). 

In addition, rationally designed siRNA or siRNA pools can be directed against 
genes involved in anemia, hemophila or hypercholesterolemia. Such genes would 
include, but are not be limited to: 
10 APOA5 

(SEQ. IDNO. 2300: GAAAGACAGCCUUGAGCAA, 
SEQ. ID NO. 2301 : GGACAGGGAGGCCACCAAA, 
SEQ. IDNO. 2302: GGACGAGGCUUGGGCUUUG, 
SEQ. ID NO. 2303: AGCAAGACCUCAACAAUAU), 

15 

HMG-CoA reductase 

(SEQ. ID NO. 2304: GAAUGAAGCUUUGCCCUUU, 
SEQ. ID NO. 2305: GAACACAGUUUAGUGCUUU, 
SEQ. ID NO. 2306: UAUCAGAGCUCUUAAUGUU, 
20 SEQ. ID NO. 2307: UGAAGAAUGUCUACAGAUA), 

NOS3 

(SEQ... ID NO. 2308: UGAAGCACCUGGAGAAUGA, 
SEQ. ID NO. 2309: CGGAACAGCACAAGAGUUA, 
25 SEQ. IDNO. 2310: GGAAGAAGACCUUUAAAGA, ■ 
SEQ. IDNO. 2309: GCACAAGAGUUAUAAGAUC), 

ARH 

(SEQ. ID NO. 23 1 0: CGAUACAGCUUGGCACUUU, 
30 SEQ. ID NO. 23 1 1 : GAGAAGCGCUGCCCUGUGA, 
SEQ. ID NO. 2312: GAAUCAUGCUGUUCUCUUU, 
SEQ. E) NO. 2313: GGAGUAACCGGACACCUUA), 



CYP7A1 
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(SEQ. ID NO. 2314: UAAGGUGACUCGAGUGUUU, 
SEQ. ID NO. 2315: AAACGACACUUUCAUCAAA, 
SEQ. ID NO. 2316: GGACUCAAGUUAAAGUAUU, 
SEQ. ID NO. 2317: GUAAUGGACUCAAGUUAAA), 

5 

FANCA 

(SEQ. ID NO. 2318: GGACAUCACUGCCCACUUC, 
SEQ. ID NO. 2319: AGAGGAAGAUGUUCACUUA, 
SEQ. ID NO. 2320: GAUCGUGGCUCUUCAGGAA, 
10 SEQ. ID NO. 2321 : GGACAGAGGCAGAUAAGAA), 

FANCG 

(SEQ. ID NO. 2322: GCACUAAGCAGCCUUCAUG, 
SEQ. ID NO. 2323 : GCAAGCAGGUGCCUACAGA, 
15 SEQ. ID NO. 2324: GGAAUUAGAUGCUCCAUUG, 
SEQ. ID NO. 2325: GGACAUCUCUGCCAAAGUC), 

ALAS 

(SEQ. ID NO. 2326: CAAUAUGCCUGGAAACUAU, 
20 SEQ. ID NO. 2327: GGUUAAGACUCACCAGUUC,: f 
SEQ. ID NO. 2328: CAACAGGACUUUAGGUUCA, 
SEQ. ID NO. 2329: GCAUAAGAUUGACAUCAUC), 

PIGA 

25 (SEQ. ID NO. 2330: GAAAGAGGGCAUAAGGUUA, 
SEQ. ID NO. 2331 : GGACUGAUCUUUAAACUAU, 
SEQ. ID NO. 2332: UCAAAUGGCUUACUUCAUC, 
SEQ. ID NO. 2333: UCUAAGAACUGAUGUCUAA), and 

30 factor VIII 

(SEQ. ID NO. 2334: GCAAAUAGAUCUCCAUUAC, 
SEQ. ID NO. 2335: CCAGAUAUGUCGUUCUUUA, 
SEQ. ID NO. 2336: GAAAGGCUGUGCUCUCAAA, 
SEQ. ID NO. 2337: GGAGAAACCUGCAUGAAAG, 
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SEQ. ID NO. 2338: CUUGAAGCCUCCUGAAUUA, 
SEQ. ID NO. 2339: GAGGAAGCAUCCAAAGAUU, 
SEQ. ID NO. 2340: GAUAGGAGAUACAAACUUU). 

5 Furthermore, rationally designed siRNA or siRNA pools can be directed 

against genes involved in disorders of the brain and nervous system. Such genes 
would include, but are not be limited to: 
APBBl 

(SEQ. ID NO. 2341: CUACGUAGCUCGUGAUAAG, 
10 SEQ. ID NO. 2342: GCAGAGAUGUCCACACGUU, 
SEQ. ID NO. 2343: CAUGAGAUCUGCUCUAAGA, 
SEQ. ID NO. 2344: GGGCACCUCUGCUGUAUUG), 

BACEl 

15 (SEQ. ID NO. 2345: CCACAGAGCAAGUGAUUUA, 
SEQ. ID NO. 2346: GCAGAAAGGAGAUCAUUUA, 
SEQ. ID NO. 2347: GUAGCAAGAUCUUUACAUA, 
SEQ. ID NO. 2348: UGUCAGAGCUUGAUUAGAA), 

20 PSBNl 

(SEQ. ID NO. 2349: GAGCUGACAUUGAAAUAUG, 
SEQ. ID NO. 2350: GUACAGCUAUUUCUCAUCA, 
SEQ. ID NO. 2351: GAGGUUAGGUGAAGUGGUU, 
SEQ. ID NO. 2352: GAAAGGGAGUCACAAGACA, 
25 SEQ. ID NO. 2353 : GAACUGGAGUGGAGUAGGA, 
SEQ. ID NO. 2354: CAGCAGGCAUAUCUCAUUA, 
SEQ. ID NO. 2355: UCAAGUACCUCCCUGAAUG), 

PSEN2 

30 (SEQ. ID NO. 2356: GCUGGGAAGUGGCUUAAUA, 
SEQ. ID NO. 2357: CAUAUUCCCUGCCCUGAUA, 
SEQ. ID NO. 2358: GGGAAGUGCUCAAGACCUA, 
SEQ. ID NO. 2359: CAUAGAAAGUGACGUGUUA), 



wo 2004/045543 PCT/US2003/036787 
153 

MASSl . 

(SEQ. IDNO. 2360: GGAAGGAGCUGUUAUGAGA, 
SEQ. ID NO. 2361: GAAAGGAGAAGCUAAAUUA, 
SEQ. ID NO. 2362: GGAGGAAGGUCAAGAUUUA, 
5 SEQ. ID NO. 2363: GGAAAUAGCUGAGAUAAUG, ), 

ARX 

(SEQ. ID NO. 2364: CCAGACGCCUGAUAUUGAA, 
SEQ. ID NO. 2365: CAGCACCACUCAAGACCAA, 
10 SEQ. ID NO. 2366: CGCCUGAUAUUGAAGUAAA, 
SEQ. ID NO. 2367: CAACAUCCACUCUCUCUUG) and 

NNMT 

(SEQ. ED NO. 2368: GGGCAGUGCUCCAGUGGUA, 
1 5 SEQ. ID NO. 2369: GAAAGAGGCUGGCUACACA, 
SEQ. ID NO. 2370: GUACAGAAGUGAGACAUAA, 
SEQ. ID NO. 2371: GAGGUGAUCUCGCAAAGUU). 

In addition, rationally designed siRNA or siRNA pools can be directed against 
20 genes involved in hypertension and related disorders. Such genes would include, but 
are not be limited to: 
angiotensin II type 1 receptor 

(SEQ. ID NQ..2-172: CAAGAAGCCUGCACCAUGU, ' 
SEQ. ID NO. 2373: GCACUUCACUACCAAAUGA, 
25 SEQ. ID NO. 2374: GCACUGGUCCCAAGUAGUA, 
SEQ. ID NO. 2375: CCAAAGGGCAGUAAAGUUU, 
SEQ. ID NO. 2376: GCUCAGAGGAGGUGUAUUU, 
SEQ. ID NO. 2377: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2378: AAAGGGCAGUAAAGUUU), 

30 

AGTR2 

(SEQ. ID NO. 2379: GAACAUCUCUGGCAACAAU, 
SEQ. ID NO. 2380: GGUGAUAUAUCUCAAAUUG, 
SEQ. ID NO. 2381: GCAAGCAUCUUAUAUAGXJU, 
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SEQ. ID NO. 2382: GAACCAGUCUUUCAACUCA), and other related targets. 

Example Xni: Validation of Multigene Knockout using RabS and Eps 

Two or more genes having similar, overlapping functions often leads to 
genetic redundancy. Mutations that knockout only one of, e.g., a pair of such genes 
(also referred to as homologs) results in little or no phenotype due to the fact that the 
remaining intact gene is capable of fulfilling the role of the disrupted counterpart. To 
fully understand the function of such genes in cellular physiology, it is often 
necessary to knockout or knockdown both homologs simultaneously. Unfortunately, 
) concomitant knockdown of two or more genes is frequently difficult to achieve in 
higher organisms {e.g. mice) thus it is necessary to introduce new technologies dissect 
gene function. One such approach to knocking down multiple genes simultaneously 
is by using siRNA. For example, Figure 11 showed that rationally designed siRNA 
directed against a number of genes involved in the clathrin-mediated endocytosis 
5 pathway resulted in significant levels of protein reduction (e.g. >80%). To determine 
the effects of gene knockdown on clathfin-related endocytosis, internalization assays 
were performed using epidermal growth factor and transferrin. Specifically, mouse 
receptor-grade EOF (Collaborative Research Inc.) and iron-saturated human 
transferrin (Sigma) were iodinated as descaribed previously (Jiang, X., Huang, F., 
0 Marusyk, A. & Sorkin, A. (2003) Mol Biol Cell 1 4, 858-70). HeLa cells grown in 1 2- 

well dishes were incubated with '^I-EGF (1 ng/ml) or ^^^I-transferrin (1 ng/ml) in 
binding medium (DMEM, 0.1% bovine serum albumin) at 37°C, and the ratio of 
internalized' anid sm^ace radioactivity was determined during 5-min time course to 
calculate specific internalization rate constant ke as described previously (Jiang, X et 
15 al). The measurements of the uptakes of radiolabeled transferrin and EOF were 
performed using short time-course assays to avoid influence of the recycling on the 
uptake kinetics, and using low ligand concentration to avoid saturation of the clathrin- 
dependent pathway (for EOF Lund, K. A., Opresko, L. K., Sb:aibuck, C, Walsh, B. J. 
& Wiley, H. S. (1990) J. Biol. Chem. 265, 15713-13723). 



The effects of knocking down RabSa, 5b, 5c, Eps, or Eps 15R (individually) 
are shown in Figure 22 and demonstrate that disruption of single genes has littie or no 
effect on EOF or Tfii internalization. In contrast, simiiltaneous knock down of Rab5a, 
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5b, and 5c, or Eps and Eps 1 5R, leads to a distinct phenotype (note: total 
concentration of siRNA in these experiments remained constant with that in 
experiments in which a single siRNA was introduced, see Figure 23). These 
experiments demonstrate the effectiveness of using rationally designed siRNA to 
5 knockdown multiple genes and validates the utility of these reagents to override 
genetic redundancy. 

Example XIV. Validation of Multigene Targeting Using G6PD, GAPDH, PLK, 
and UQC. 

1 0 Further demonstration of the ability to knock down expression of multiple 

genes using rationally designed siRNA was performed using pools of siRNA directed 
against four separate genes. To achieve this, siRNA were transfected into cells (total 
siRNA concentration of lOOnM) and assayed twenty-four hours later by B-DNA. 
Results shown in Figure 24 show that pools of rationally designed molecules are 

1 5 capable of simultaneously silencing four different genes. 

Example XV. Validation of Multigene Knockouts As Demonstrated by Gene 
Expression Profiling, a Prophetic Example 

To further demonstrate the ability to concomitantly knockdown the expression 
20 of multiple gene targets, single siRNA or siRNA pools directed against a collection of 
genes (e.g. 4, 8, 16, or 23 different targets) are simultaneously transfected into cells 
and cultured for twenty-four hours. Subsequently, mRNA is harvested from treated 
(and untreated) cells andJabeled with one of two fluorescent probes dyes {e.g. a red 
fluorescent probe for the treated cells, a green fluorescent probe for the control cells.). 
25 Equivalent amounts of labeled RNA from each sample is then mixed together and 
hybridized to sequences that have been Unked to a solid support {e.g. a slide, "DNA 
CHIP"). Following hybridization, the slides are washed and analyzed to assess 
changes in the levels of target genes induced by siRNA. 

30 Example XVI. Identifying Hyperfunctional siRNA 

Identification of Hyperfimctional Bct2 siRNA 

The ten rationally desigoed Bcl2 siRNA (identified in Figure 13, 14) were 
tested to identify hyperpotent reagents. To accomplish this, each of the ten Bcl-2 
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siRNA were individually transfected into cells at a 300pM (0.3nM) concentrations. 
Twenty-four hours later, transcript levels were assessed by B-DNA assays and 
compared with relevant controls. As shown in Figure 25, while the majority of Bcl-2 
siRNA failed to induce functional levels of silencing at this concentration, siRNA 1 
5 and 8 induced >80% silencing, and siRNA 6 exhibited greater than 90% silencing at 
this subnanomolar concentration. 

By way of prophetic examples, similar assays could be performed with any of 
the groups of rationally designed genes described in Example VII or Example VIII. 
1 0 Thus for instance, rationally designed siRNA sequences directed against 
PDGFA 

(SEQ. ID NO. 2383: GGUAAGAUAUUGUGCUUUA, 
SEQ. ID NO. 2384: CCGCAAAUAUGGAGAAUUA, 
SEQ. ID NO. 2385: GGAUGUACAUGGCGUGUUA, 
1 5 SEQ. ID NO. 2386: GGUGAAGUUUGUAUGUUUA), or 

PDGFB 

(SEQ. ID NO. 2387: GCUCCGCGCUUUCCGAUUU, 
SEQ. ID NO. 2388: GAGCAGGAAUGGUGAGAUG, 
20 SEQ.IDN0.2389:GAACUUGGGAUAAGAGUGU, .:. 
SEQ. ID NO. 2390: CCGAGGAGCUUUAUGAGAU, 
SEQ. ID NO. 2391: UUUAUGAGAUGCUGAGUGA) 

could be introduced -intQiGells at increasingly limiting concentrations to determine 
whether any of the duplexes are hyperfiinctional. Similarly, rationally designed 

25 sequences directed against 
HIFl alpha 

(SEQ. ID NO. 2392: GAAGGAACCUGAUGCUUUA, 
SEQ. ED NO. 2393: GCAUAUAUCUAGAAGGUAU, 
SEQ. ID NO. 2394: GAACAAAUACAUGGGAUUA, 
30 SEQ. ID NO. 2395: GGACACAGAUUUAGACUUG), or 



VEGF 

(SEQ. ID NO. 2396: GAACGUACUUGCAGAUGUG, 
SEQ. ID NO. 2397: GAGAAAGCAUUUGUUUGUA, 
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SEQ. ID NO. 2398: GGAGAAAGCAUUUGUUUGU, 

SEQ. ID NO. 2399: CGAGGCAGCXJUGAGUUAAA) could be introduced into cells 
at increasingly limiting concentrations and screened for hyperflinctional duplexes. 

5 Example XVII: Gene Silencing: Prophetic Example 

Below is an example of how one might transfect a cell. 

a. Select a cell line. The selection of a cell line is usually determined by the 
desired application. The most important feature to RNAi is the level of 
expression of the gene of interest. It is highly recommended to use cell lines 
10 for which siRNA transfection conditions have been specified and validated. 



b. Plate the Cells. Approximately 24 hours prior to transfection, plate the cells at 
the appropriate density so that they will be approximately 70 - 90% confluent, 
or ^proximately 1 x lO' cells/ml at the time of transfection. Cell densities 

15 that are too low may lead to toxicity due to excess exposure and uptake of 

transfection reagent-siRNA complexes. Cell densities that are too high may 
lead to low transfection efficiencies and little or no silencing. Incubate the 
cells overnight. Standard incubation conditions for mammalian cells are 37°C 
in 5% CO2. Other cell types, such as insect cells, require different 

20 temperatures and CO2 concentrations that are readily ascertainable bypersons 

skilled in the art. Use conditions appropriate for the cell type of interest. 



c. SiRNA re-suspensi©oasA.dd 20 (aI siRNA universal buffer to each siRNA to 
generate a final concentration of SO pM. 

25 

d. SiRNA-lipid complex formation. Use RNase-firee solutions and tubes. Using 
the following table. Table XI: 
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e. 



Table XI 


■.;.„ . \ 


96-well 


24^eU ft 


Mixture 1 (TransIT-TKO-Plasniid dilution mixture) 


Opti-MEM 


9.3 lal 


46.5 111 


TranslT-TKO (1 


0.5 fil 


2.5 Hi 








t*.Mixftire 1 Final Yqluniti ^ 


mt ui ^ ■ 





Mixture 2 (siRNA dUution mixture) 


Opti-MEM 


9.0 Hi 


45.0 Ml 


siRNA (1 MM) 


1.0^1 


S.OmI 


. Rtixtutfe 2 Fjm.al Vbliimc 




f -5i5.0Mi^;' 




Mixture 3 (siRNA-Transfection reagent mixture) 


Mixture 1 


10 Ml 


50 Ml 


Mixture 2 


10 Ml 


50 Ml 


^ixdire S.Bial VoluiSe^ 


:^ zm ^ 






Incubate 20 minutes at room temperature. 


Mixture 4 (Media-siRNA/Transfection reagent mixture) 


Mixture 3 


20 Ml 


100 Ml 


Complete media 


80 Ml 


400 Ml 


V Mjjjlurev4:jinal'VoIumii. .^i 




soo;lii' 




Incubate 48 hours at 3fC. 



5 Transfection . Create a Mixture 1 by combining the specified amounts of OPTI-MEM 
serum free media and transfection reagent in a sterile polystyrene tube. Create a 
Mixture 2 by combining specified amounts of each siRNA with OPTI-MEM media in 
sterile 1 ml tubes. Create a Mixture 3 by combining specified amounts of Mixture 1 
and Mixture 2. Mix gently (do not vortex) and incubate at room temperature for 20 
10 minutes. Create a Mixture 4 by combining specified amounts of Mixture 3 to 

complete media. Add appropriate volume to each cell culture well. Incubate cells 
with transfection reagent mixture for 24 - 72 hours at 37°C. This incubation time is 
flexible. The ratio of silencing will remain consistent at any point in the time period. 
Assay for gene silencing using an appropriate detection method such as RT-PCR, 

1 5 Westem blot analysis, immunohistochemistry, phenotypic analysis, mass 



wo 2004/045543 PCTAJS2003/036787 
159 

spectrometry, fluorescence, radioactive decay, or any other method that is now known 
or that comes to be known to persons skilled in the art and that from reading this 
disclosure would useful with the present invention. The optimal window for 
observing a knockdown phenotype is related to the mRNA turnover of the gene of 
5 interest, although 24 - 72 hours is standard. Final Voliime reflects amount needed in 
each well for the desired cell culture fonnat. When adjusting volxifnes for a Stoc^' 
Mix, an additional 10% should be used to accommodate variability in pipetting, etc. 
Duplicate or triplicate assa}^ should be carried out when possible. 

1 0 While the invention has been described in coimection with specific 

embodiments thereof, it will be understood that it is capable of further modifications 
and this application is intended to cover any variations, uses, or adaptations of the 
invention following, in general, the principles of the invention and including such 
departure from the present disclosure as come within known or customary practice 

1 5 within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth and as follows in the scope of the appended claims. 



-20 
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laims 

A method for selecting siRNA comprising selecting an siRNA molecule of 
19-25 nucleoside bases, said method comprising: 

(a) selecting a target gene; 

(b) measuring the functionality of sequences of 19 - 25 nucleotides in 
length that are substantially complementary to a stretch of nucleotides 
of the target sequence, wherein said fiinctionality is dependent upon 
non-target specific criteria. 

The method according to claim 1 wherein said functionality is determined 
by applying one of the following formulas: 

Formula I = -(GC/3) +(AU,5.i9) -(Tm2o-c)*3 -(G,3)*3 -(C19) +(Ai9)*2 +(A3) 
+(Uio)+(A,4)-(U5)-(A„); 

Formula II = -(GC/3) -(AUi5.i9)*3 -(Gi3)*3 -(C19) +(Ai9)*2 +(A3); 
Formula ffl = -(GC/3) +(AUi5.i9) -(Tm20'c)*3; 

Formula IV = -(GC/2)+(AU,5.i9)/2-(Tm2o-c)*2 -(Gi3)*3 -(C,?) +(Ai9)*2 
+(A3) +(U,o)+(A,4)-(U5)-<Ai,); 

Formula V = -(Gi3)*3 -(Gi9).±(Ai.9-)*2 +(A3) + (Uio)+(A,4) -(Us) -(An); 
Formula VI = -(Gi3)*3 -(C19) +(Ai9)*2 +(A3); 

Formula VII = -(GC/2) +(AUi5-i9)/2 -( Tm2o°c)*l -(Gi3)*3 -(C,?) +(A,9)*3 
+(A3)*3 +(U,o)/2+(A,4)/2 -(U5)/2 -(An)/2; 



wherein in Formulas I - VII: 



AU15.19 = 0-5 depending on the number of A or U bases on the sense 
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strand at positions 15-19; 
Gi3 = 1 if G is the base at position 1 3 on the sense strand, otherwise its 
value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its 
5 value is 0; 

GC= the number of G and C bases in the entire sense strand; 
Tm 20-0= 1 if the Tm is greater than 20°C; 

A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 
value is 0; 

10 All = 1 if A is the base at position 11 on the sense strand, otherwise its 

value is 0; 

Ai4 = 1 if A is the base at position 14 on the sense strand, otherwise its 
value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
15 value is 0; 

Us = 1 if U is the base at position 5 on the sense strand, otherwise its 
value is 0; 

Uio = 1 if U is the base at position 10 on the sense strand, otherwise its 
value is 0; 

20 ... or. 

Formula Vffl: (44)*Gi343*Ai-12 WH-StIsJO*An-10*U4-10*C3-10*C5-10*C6- 
9*Aio-9*U9-9*C,g-8*Gio-7*Ui-7*Ui6-7*C,7-7*Ci9 
25 +7*Ui7+8*A2+8*A4+8*As+8*C4+9*G8+10*A7+10*Ui8+ll*Ai9+ 
ll*C9+15*Gi+ 18*A3+19*Uio-Tm-3* (GC,otoi)-6*(GCi5-i9)- 
30*X; and 

FonnulalX: (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*A,9+(14.2)*Uio+(10.5)* 
30 . C9+(23.9)*Gi+(16.3)*G2+(-12.3)*Ai,+(-19.3)*Ui+(-12.1)*U2+ 
(-1 l)*U3+(-15.2)*U,^(-l 1 .3)*Ui6+ (-1 1.8)*C3+(-17.4)*C6+(- 
10.5)*C7+ (-13.7)*Gi3+(-25.9)*Gi9-Tm-3* (GC,ottO-6*(GCi5-i9)- 
30*X 

wherein 
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Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4= 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
5 As = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
Afi = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio = 1 if A is the base at position 10 of the sense strand, othenvise its value is 0; 
An = 1 if A is the base at position 11 of the sense strand, otherwise its value is 0; 
10 Ai3 = 1 if A is the base at position 1 3 of the sense strand, otherwise its value is 0; 
Ai9 = 1 if A is the base at position 1 9 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
15 C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 

Cfi = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
20 - Ci7 =^ 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

Cis = 1 if C is the base at position 1 8 of the sense strand, otherwise its value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is on}5t,t8;i)ase pairs in length, its value is 0; 

25 Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2= 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
■ 30 Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
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Us = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
5 Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U16 = 1 if U is the base at position 1 6 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

10 

GCi5_i9 = the number of G and C bases within positions 15 - 19 of the sense strand 

or within positions 1 5 -1 8 if the sense strand is only 1 8 base pairs in length; 
GQotai = the number of G and C bases in the sense strand; 
Tm= 100 if the targeting site contains an inverted repeat longer then 4 base pairs, 
1 5 otherwise its value is 0; and 

X = the number of times that the same nucleotide repeats four or more times in a 
row. 



A method of gene-silencing comprising selecting an siRNA according to 
claim 2 and introducing it into a cell. 



The method according to claim 3 wherein said introducing is by allowing 
passive uptake of the siRNA. -MR^^^asaa,- 



The method according to claim 3, wherein said introducing is through the 
use of a vector. 



6. A method for developing an siRNA algorithm for selecting siRNA, said 
method comprising: 
30 (a) selecting a set of siRNA;- 

(b) measuring the gene silencing abiHty of each siRNA from said set; 

(c) determining the relative fiinctionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
absence of at least one variable selected from the group consisting of 
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the total GC content, melting temperature of the siRNA, GC content at 
positions 15-19, the presence or absence of a particular nucleotide at 
a particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 
5 (e) developing an algorithm using the information of step (d). 

7. A method of selecting an siRNA with improved fimctionality, said method 
comprismg using the algorithm of claim 6. 

10 8. A method of selecting hyperfimctional siRNA, said method comprising 
using at least one ftmctional siRNA, wherein at least one said functional 
siRNA has been selected according to the method of claim 7 and 
measuring the silencing ability of said at least one ftmctional siRNA, 
wharein silencing ability is measured at a concentration of less than 1 

15 nanomolar siRNA. 

9. An siRNA molecule, wherein said siRNA molecule is effective at 
silencing Bcl-2. 

20 10. . The siRNA molecule of claun 9, wherein said siRNA molecule comprises 

a sequence substantially similar to a sequence selected from the group 

consisting of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 

GAAGUACAUCCAUUAUAAG (SEQ*JDiNO..302); 

GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
25 AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 

UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 

CAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 

UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 

GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
30 GGAGAUAGUGAUGAAGUAG (SEQ. ID NO. 309); and 

GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 



1 1 . The siRNA molecule of claim 1 0, wherein said siRNA molecule 
comprises a sequence selected from the group consisting of 
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GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ED NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
GGAGAUAGUGAUGAAGUAC (SEQ. ID NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

12. The siRNA molecule of claim 1 1 j^herein said siRNA molecule 
comprises GCAUGCGGCCUCUGUUUGA . 

13. The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
a sense strand and an anti-sense strand. 

14. The siRNA molecule of claim 9, wherien said siRNA molecule comprises 
a hairpin. 

1 5. The siRNA molecule of claim 9, wherein said siRNA molecule comprises 
between 18 and 30 base pairs. 

16. A kit for gene silencing comprising at least one siRNA selected fix>m the 
group consisting of sequences substantially similar to the group consisting 
of GGGAGAUAGUGAUGAAGUA (SEQ. ID NO. 301); 
GAAGUACAUCCAUUAUAAG (SEQ. ID NO. 302); 
GUACGACAACCGGGAGAUA (SEQ. ID NO. 303); 
AGAUAGUGAUGAAGUACAU (SEQ. ID NO. 304); 
UGAAGACUCUGCUCAGUUU (SEQ. ID NO. 305); 
GCAUGCGGCCUCUGUUUGA (SEQ. ID NO. 306); 
UGCGGCCUCUGUUUGAUUU (SEQ. ID NO. 307); 
GAGAUAGUGAUGAAGUACA (SEQ. ID NO. 308); 
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GGAGAUAGUGAUGAAGUAC (SEQ. ED NO. 309); and 
GAAGACUCUGCUCAGUUUG (SEQ. ID NO. 310). 

17. A method of gene silencing comprising using the siRNA molecule of 
I claim 10. 

1 8. A method of gene silencing comprising using the siRNA molecule of claim 
11. 

) 19. A kit, wherein said kit is comprised of at least two siRNA, wherein said at 

least two siRNA comprise a first optimized siRNA and a second optimized 
siRNA, wherein said first optimized siRiNA and said second optimized 
siRNA are optimized according to one of the following formulas: 

5 Formula I = -(GC/3) +(AUis-i9) -Crm2o»c)*3 -(Gi3)*3 -(Cs) +(Ai9)*2 +(A3) 

+(U,o)+(Ai4)-<U5)-(Aii); 

Formula H = -(GC/3) -(AUi5-i9)*3 -(G,3)*3 -(C19) +(Ai9)*2 +(A3); 

0 Formula lU = -(003) +(AUi5-i9) -(Tm2o-c)*3; 

Formula IV = -(GC/2)+(AUis-i9)/2-(Tm2o°c)*2 -(G,3)*3 -(C19) +(Ai9)*2 
+(A3) +CU,o)+(A,4) -m -(A, ,); ■ -r^-r..^.. 

5 Formula V = -(Gi3)*3 -(€,9) +(Ai9)*2 +(A3) + (U,o)+(A,4) -{Uj) -(A, ,); 

Formula VI = -(Gi3)*3 -(C19) +(Ai9)*2 +(A3); 

Formula VII = -(GC/2) +(AUi5-i9)/2 -( Tm2o->c)* 1 -(Gi3)*3 -(€,9) +(Ai 9)*3 
\0 +(A3)*3 +(U,o)/2+(Ai4)/2-(U5)/2-(A„)/2;. 



wherein in Formulas I - VII: 
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AU15.19 = 0-5 depending on the number of A or U bases on the sense 

strand at positions 15-19; 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its 
5 value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its 

value is 0; 

GC = the number of G and C bases in the entire sense strand; 
Tm 2o°c = 1 if the Tm is greater than 20°C; 
10 A3 = 1 if A is the base at position 3 on the sense strand, otherwise its 

value is 0; 

Ai 1 = 1 if A is the base at position 11 Oil the sense strand, otherwise its 
value is 0; 

A]4 = 1 if A is the base at position 14 on the sense strand, otherwise its 
15 value is 0; 

Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its 
value is 0; 

U5 = 1 if U is Hbe base at position 5 on the sense strand, otherwise its 
value is 0; 

20 Uio = tif U is the base at position 1 0 on the sense strand, otherwise its 

value is 0; 



25 Formula VIII: (-14)*Gi3-13*Ai-12*U7-ll*U2-10*Aii-10*U4-10*C3-10*Cs-10*C6- 
9*A,o-9*U9-9*C,8-8*Gio-7*U,-7*Ui6-7*C,7-7*Ci9 
+7*U,7+8*A2+8*A4 +8*A5^-8*C4 +9*G8 +10*A7+10*Ui8+l 1 *Ai9+ 
ll*C9+15*Gi+ 18*Aj+19*U,o-Tm-3* (GCtou,i)-6*(GC,5_i9)- 
30*X; and 

30 

Formula IX: (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*Ai9+(14.2)*Uio+(10.5)* 
C9+(23.9)*G,+(16.3)*G2+(-12.3)*A„+(-19.3)*Ui+(-12.1)nj2+ 
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(-ll)*U3+(-15.2)*U,5+(-11.3)*Ui6+(-11.8)*C3+(-17.4)*C6+(- 
10.5)*C,+ (-13.7)*G,3+(-25.9)*G,9-Tm-3* (GQotai) - 6*(GCi5-i9)- 
30*X 

wherein 

5 Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A) = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 

A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
10 A6 = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7 = lif A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strandV otherwise its value is 0; 

A] ] = lif A is the base at position 1 1 of the sense strand, otherwise its value is 0; 

Ai3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
15 Ai9 = 1 if A is the base at position 1 9 of the sense strand, otherwise if another base 
is present or tiie sense strand is only 1 8 base pairs in length, its value is 0; 



C3 = 1 if C is the base at position 3 of the sense strand, othawise its value is 0; 

C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
20 C5 = 1 if C is the base atpbsition 5 of the sense strand, otherwise its value is 0; 

Cs = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

Cg = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 

Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
25 Cig = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 18 base pairs in length, its value is 0; 



Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
30 G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 
Gio = 1 if G is the base at position 1 0 on the sense strand, otherwise its value is 0; 
Gi3 = 1 if G is the base at position 1 3 on the sense strand, otherwise its value is 0; 
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Gi9 = 1 if G is the base at position 19 of the sense strand, otherwise if another base 
is present or the sense strand is only 1 8 base pairs ui length, its value is 0; 



U) = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
5 U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
10 Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
U 15 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 
U 16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 = 1 if.U is the base at position 18 on the sense strand, otherwise its value is 0; 

15 

GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 

or within positions 1 5 -1 8 if the sense strand is only 1 8 base pairs in length; 
GCtotai= the number of G and Chases in the sense strand; 
Tm = 1 00 if the targeting site contains an inverted repeat longer then 4 base pairs, 
20 otherwise its value is Qj'and 

X = the number of times that the same nucleotide repeats four or more times in a row. 
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